(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 




llllllllllllIUBIIIlllllllllllllllllUllillll 



(43) International Publication Date (10) International Publication Number 

15 February 2001 (15.02.2001) PCT WO 01/1 1061 A2 



(51) International Patent Classification 7 : C12N 15/82 (74) Agents: KINGWELL, Brian, G. et al.; Smart & Biggar, 

Vancouver Centre, Suite 2200, 650 W. Georgia Street, P.O. 

(21) International Application Number: PCT/CA00/00907 Box 1 1560, Vancouver, British Columbia V6B 4N8 (CA). 

(22) International Filing Date: 4 August 2000 (04.08.2000) 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 

(25) Filing Language: English AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 

DE, DK, DM, DZ, EE, ES, Fl, GB, GD, GE, GH, GM, HR, 

(26) Publication Language: English HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC LK, LR, 

LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
(30) Priority Data: NO , NZ, PL, PT, RO, RU, SD, SE, SG, SI. SK, SL, TJ, TM, 

60/147,133 4 August 1999 (04.08.1999) US TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(71) Applicant (for all designated States except US): THE 

UNIVERSITY OF BRITISH COLUMBIA [CA/CA]; < 84 ) Designated States (regional): ARIPO patent (GH, GM, 
University-Industry Liaison Office, IRC Building, Room KE ' ^ MW ' ^ SD > SL » sz » ^ UG » zw ^^ Eurasian 

331, 2194 Health Sciences Mall, Vancouver, British patent (AM, AZ, BY, KG, KZ,MD,RU,TJ, TM), European 

Columbia V6T 1Z3 (CA) P** 5111 < AT ' BE » CH » CY » DE » DK > ES * H » GB * GR - ^ 

IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, 

(72) Inventors; and CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 
(75) Inventors/Applicants (for US only): KUNST, Ljerka 

[CA/CA]; 2810 Newmarket Drive, North Vancouver, Published: 

British Columbia V7R 2T4 (CA). CLEMENS, Sabine — Without international search report and to be republished 



[DE/CA]; 4443 West 16th Avenue, Vancouver, British upon receipt of that report. 

Columbia V6R 3E7 (CA). 



[Continued on next page] 



(54) Title: REGULATION OF EMBRYONIC TRANSCRIPTION IN PLANTS 



CD 

CD 

CO 

? 

Q 
Q 

cr 

CD 

o 

_ _ o 

-250 TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 

1—1 

SO 



-950 


ACTCA 


TAAAAACTAG TAGATTGGTT GGTTGGTTTC CATGTACCAG 


-900 


AAGGCTTACC 


CTATTAGTTG 


AtproFM "> 
AAAGTTGAAA CTTTGTTCCC 


TACTCAATTC 


-650 


CTAGTTGTGT 


AAATGTATGT 


ATATGTAATG 


CGTATAAAAC 


GTAGTACTTA 


-800 


AATGACTAGG 


AGTGGTTCTT 


GAGACOGATG 


AGAGATGGGA 


GCAGAACTAA 


-750 


AGATGATGAC 


ATAATTAAGA 


ACGAATTTGA 


AAGGCTCTTA 


GGTTTGAATC 


-700 


CTATTCGAGA 


ATGTTTTTGT 


CAAAGATAGT 


GGCGATTTTG 


AACCAAAGAA 


-6S0 


AACATTTAAA 


AAATCAGTAT 


CCGGTTAGGT 


TCATGCAAAT 


AGAAAGTGGT 


-600 


CTAGGATCTG 


ATTGTAATTT 


TAGACTTAAA 


GAGTCTCTTA 


AGATTCAATC 


-550 


CTGGCTCTGT 


ACAAAACTAC 


AAATAATATA 


TTTTAGACTA 


TTTGGCCTTA 


-500 


ACTAAACTTC 


CACTCATTAT 


TTACTGAGGT 


TAGAGAATAG 


ACTTGCGAAT 


-450 


AAACACATTC 


CCGAGAAATA 


CTCATGATCC 


CATAATTAGT 


CAGAGGGTAT 


-400 


GCCAATCAGA 


TCTAAGAACA 


CACATTCCCT 


CAAATTTTAA 


TGCACATGTA 


-350 


ATCATAGTTT 


AGCACAATTC 


AAAAATAATG 


TAGTATTAAA 


GACAGAAATT 


-300 


TGTAGACTTT 


TTTTTGGCGT 


TAAAGGAAGA 


CTAAGTTTAT 


ACGTACATTT 


-250 


TATTTTAAGT 


GGAAAACCGA 


AATTTTCCAT 


CGAAATATAT 


GAATTTAGTA 


-200 


TATATATTTC 


TGCAATGTAC 


TATTTTGCTA 


TTTTGGCAAC 


TTTCAGTGGA 


-150 


CTACTACTTT 


ATTACAATGT 


GTATGGATGC 


ATGAGTTTGA 


GTATACACAT 


-100 


GTCTAAATGC 


ATGCTTTGCA 


AAACGTAACG 


GACCACAAAA 


GAGGATCCAT 


-50 


GCAAATACAT 


CTCATAGCTT 


CCTCCATTAT 


TTTCCGACAC 


AAACAGAGCA 


lfAT^VCGTCCC 


TTAACGTTAA 


<• 

GCTCCTT 







(57) Abstract: Nucleic acid constructs are provided comprising transcriptional regulatory regions homologous to plant FAE1 pro- 
Q moters. In some embodiments, these constructs may be used in transgenic cells or plants to promote expression of foreign and 
^ endogenous genes in developing seeds, for example to affect seed lipid metabolism, protein or carbohydrate composition and accu- 
^ mulation, or seed development. 
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REGULATION OF EMBRYONIC TRANSCRIPTION IN PLANTS 



FIELD OF THE INVENTION 

The invention is in the field of nucleic acid sequences capable of regulating 
5 transcription, particularly sequences that may promote transcription during embryogenesis in 
plants. 



BACKGROUND OF THE INVENTION 

Most of the information about seed-specific gene expression comes from studies of 

1 0 genes encoding seed storage proteins like napin, a major protein in the seeds of Brassica 

napus, or conglycinin of soybean. Upstream DNA sequences directing strong embryo-specific 
expression of these storage proteins have been used successfully in transgenic plants to 
manipulate seed lipid composition and accumulation (Voelker et aL 1996). However, 
expression of storage protein genes begins fairly late in embryogenesis. Thus, promoters of 

1 5 seed storage protein genes may not be ideal for all seed-specific applications. For example, 
storage oil accumulation commences significantly before the highest level of expression of 
either napin (Stalberg et aL, 1996) or conglycinin (Chen et aL 1988) is achieved. It is, 
therefore of interest to identify other promoters which may modulate expression of genes in 
developing plant embryos. 

20 A variety of transcriptional regulatory regions that may be active during plant 

embryogenesis are known, as disclosed for example in: U.S. Patent No. 5,792,922 issued 1 1 
August 1998 to Moloney; U.S. Patent No. 5,623 ? 067 issued 22 April 1997 to Vandekerckhove 
et ah: International Patent Publication W09845461 published 15 October 1998. There remains 
a need for alternative transcriptional regulatory regions. 

25 FATTY ACID ELONGATION! (FAE1) genes encode condensing enzymes involved in 

plant very long chain fatty acid biosynthesis. The FAE1 condensing enzyme is thought to be 
localized in the endoplasmic reticulum where it catalyzes the sequential elongation of CI 8 
fatty acyl chains to C22 in length (Kunst et al., 1992). FAEI genes have been cloned and 
described recently by James et al. (1995), International Patent Publication WO 96/13582. 

30 

SUMMARY OF THE INVENTION 

In one aspect, the invention provides transcriptional regulatory regions derived from 
FAEI genes. The transcriptional regulatory regions of the invention may be useful in 
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promoting early seed-specific transcription of heterologous sequences to which they are 
operably linked. The transcriptional regulatory regions of the invention may be used in a wide 
variety of plants, including Brassica sp., Arabidopsis and other plant species. DNA constructs 
comprising the transcriptional regulatory sequences of the invention may be active during 
5 fatty acid or lipid biosynthesis in the plant embryo. Certain embodiments of the constructs of 
the invention may be used in transgenic plants to promote expression of heterologous 
sequences in developing seeds. In various embodiments, the constructs of the invention may 
be used to mediate gene expression that affects seed lipid metabolism, or seed protein 
composition or seed carbohydrate composition, or seed development. In alternative 
10 embodiments, the transcriptional regulatory regions of the invention may also be useful for the 
production of modified seeds containing novel recombinant proteins which have 
pharmaceutical, industrial or nutritional value. 

BRIEF DESCRIPTION OF THE DRAWINGS 
15 Figure 1 shows a 934 bp DNA sequence comprising the Arabidopsis thaliana FAE1 

transcription regulatory sequence. 

Figure 2 shows a 1588 bp DNA sequence comprising the Brassica napus FAE1 
transcription regulatory sequence. 

Figure 3 shows a 1069 bp DNA sequence comprising the Lunaria annua FAE1 
20 transcription regulatory sequence. 

Figure 4 shows an alignment of the Arabidopsis thaliana (A J.), Lunaria annua (L.a.) 
and Brassica napus (B.n.) transcription regulatory sequences. Asterisks below the sequences 
indicate identical nucleotides in each of the three sequences. A number of putative cis-acting 
sequence motifs are identified in the A. thaliana sequence: an EMI ABA box at -44bp to - 
25 36bp having the sequence ACATCTCAT, for which the published consensus sequence is 
ACGTGTCAT (Rowley, D.L. and Herman, E.M. (1997), Biochimica et Biophysica Acta 
1345:1-4); an A-300 box at -51bp to -46bp having the sequence TGCAAT, for which the 
published consensus sequence is TG(T/A/C)AAA(G/T) (Morton et al. (1994) in Seed 
Development and Germination (Kigel, J. and Gallili, G., eds.) pp. 103-138. Marcel Dekker. 
30 New York); G-box 1 at -105 to -100 bp having the sequence CACATG, for which is the 
consensus sequence is CACCTG, and G-box 2 at -164 to -159 bp having the sequence 
CAACTT, for which the consensus sequence is CAACTG (Kawogoe, Y. and Murai, N. 
(1992) Plant J. 2:927-936; CE1 element at -226 to -218 bp having the sequence 
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TTCCATCGA. for which the consensus sequence is TGCCACCGG, and a CE3 element at - 
381bp to -369 bp having the sequence ACACATTCCCTC, for which the consensus sequence 
is ACGCGTGTCCTC (Shen et ah, (1996) Plant Cell 8:1 107-1 119). Not highlighted is a 
putative RY repeat motif at -53bp to -47bp having the sequence CATGCAA, for which the 
5 consensus sequence is CATGCAT (Dickinson et al. (1988) Nucleic Acid Res. 16:371; 
Lelievre et al. (1992) Plant Physiol. 98:387-391). Also shown, as Con. 4, is a consensus 
sequence, wherein R=G or A, Y=T/U or C, M=A or C, K=G or T/U, S=G or C, W=A or TAJ, 
B-G or C or T/U, D=A or G or T/U, H=A or C or T/U, V=A or G or C and N=A or G or C or 
T/U. 

10 Figure 5 shows an alignment of the Arabidopsis thaliana (A J.) and Lunaria annua 

(La.) transcription regulatory sequences. Asterisks below the sequences indicate identical 
nucleotides in each of the two sequences. The base at position -400 in the At. sequence is 
highlighted. The alignment of sequences in both Figure 4 and Figure 5 was accomplished 
using the CLUSTALW program (version 1.74) for multiple sequence alignments, using a gap 

15 open penalty of 15, a gap extension penalty of 6.66 and an IUB DNA weight matrix. Also 
shown, as Con. 5, is a consensus sequence, wherein R=G or A, Y=T/U or C, M=A or C, K=G 
or T/U, S=G or C, W=A or T/U, B=G or C or T/U, D=A or G or T/U, H=A or C or T/U, V=A 
or G or C and N=A or G or C or T/U. 

Figure 6 includes two bar graphs illustrating hydroxy fatty acid content of A) FAE1- 

20 FAH1 2 and B) napin-FAH12 transgenic seeds, expressed as percentage of total seed fatty 
acids. 

Figure 7 shows an alignment of the Brassica napus (B.n.) and Lunaria annua (L.a.) 
FEA1 transcription regulatory sequences. Asterisks below the sequences indicate identical 
nucleotides in each of the two sequences. 
25 Figure 8 shows an alignment of the Brassica napus (B.n.) and Arabidopsis thaliana 

(A J.) FEA1 transcription regulatory sequences. Asterisks below the sequences indicate 
identical nucleotides in each of the two sequences. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The recombinant nucleic acid molecules of the invention may comprise a heterologous 

promoter sequence operably linked to a nucleic acid sequence, wherein the promoter 
sequence comprises a transcriptional regulatory region capable of mediating seed-specific 
expression in Arabidopsis. The transcriptional regulatory region may be obtainable from a 
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plant FAE1 gene. Alterntively, The transcriptional regulatory region may hybridize under 
stringent conditions to a 5' region of the plant FAE1 gene. In further alternative 
embodiments, The transcriptional regulatory region may be at least 70% identical when 
optimally aligned to the 5 1 region of the plant FAE1 gene. 
5 In alternative embodiments, the invention provides isolated nucleic acids comprising 

the transcriptional regulatory regions of the invention. By isolated, it is meant that the 
isolated substance has been substantially separated or purified away from other biological 
components with which it would otherwise be associated, for example in vivo. The term 
'isolated' therefore includes substances purified by standard purification methods, as well as 
10 substances prepared by recombinant expression in a host as well as chemically synthesized 
substances. 

In the context of the present invention, "transcriptional regulatory region" means a 
nucleotide sequence capable of mediating or modulating transcription of a nucleotide 
sequence of interest, when the transcriptional regulatory region is operably linked to the 

! 5 sequence of interest. Conversely, a transcriptional regulatory region and a sequence of interest 
are "operably linked" when the sequences are functionally connected so as to permit 
transcription of the sequence of interest to be mediated or modulated by the transcriptional 
regulatory region. In some embodiments, to be operably linked, a transcriptional regulatory 
region may be located on the same strand as the sequence of interest. The transcriptional 

20 regulatory region may in some embodiments be located 5* of the sequence of interest. In such 
embodiments, the transcriptional regulatory region may be directly 5' of the sequence of 
interest or there may be intervening sequences between these regions. The operable linkage of 
the transcriptional regulatory region and the sequence of interest may require appropriate 
molecules (such as transcriptional activator proteins) to be bound to the transcriptional 

25 regulatory region, the invention therefore encompasses embodiments in which such molecules 
are provided, either in vitro or in vivo. 

The term "recombinant" means that something has been recombined. so that when 
made in reference to a nucleic acid molecule the term refers to a molecule that is comprised of 
nucleic acid sequences that are joined together by means of molecular biological techniques. 

30 The term "recombinant" when made in reference to a protein or a polypeptide refers to a 
protein molecule which is expressed using a recombinant nucleic acid molecule. The term 
"heterologous" when made in reference to a nucleic acid sequence refers to a nucleotide 
sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence 
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to which it is not ligated in nature, or to which it is ligated at a different location in nature. The 
term "heterologous" therefore indicates that the nucleic acid molecule has been manipulated 
using genetic engineering, i.e. by human intervention. 

Sequences may be derived or obtainable from plant FAE1 genes by deduction and 
5 synthesis based upon the wild-type FAE1 gene sequences. Derived sequences may be 
identified in different organisms, for example by isolation using as probes the nucleic acid 
sequences of the invention. Alternative transcriptional regulatory regions may be derived 
through mutagenesis or substitution of wild-type sequences, such as the sequence disclosed 
herein. Derived nucleic acids of the invention may be obtained by chemical synthesis, 
1 0 isolation, or cloning from genomic DNAs using techniques known in the art, such as the 

Polymerase Chain Reaction (PCR). Consensus sequences, such as those illustrated in Figures 
4 and 5 are alternative embodiments of the nucleic acids of the invention, derived from the 
disclose wild-type FAE1 gene sequences. Nucleic acids of the present invention may be used 
to design alternative primers (probes) suitable for use as PCR primers to amplify particular 

15 regions of an FAE1 gene. Such PCR primers may for example comprise a sequence of 15-20 
consecutive nucleotides of the sequences of the invention. To enhance amplification 
specificity, primers of 20-30 nucleotides in length may also be used. Methods and conditions 
for PCR amplification are described in Innis et al. (1990); Sambrook et al. (1989); and 
Ausubel et al. (1995). As used herein, the term "probe" when made in reference to an 

20 oligonucleotide refers to an oligonucleotide which is capable of hybridizing to another 

oligonucleotide of interest. A probe may be single-stranded or double-stranded. Probes are. for 
example, useful in the detection, identification, amplification and isolation of particular gene 
sequences. Oligonucleotide probes may be labelled with a "reporter molecule." so that the 
probe is detectable using a detection system, such as enzymatic, fluorescent, radioactive or 

25 luminescent detection systems. 

Derived nucleic acids of the invention may also be identified by hybridization, such as 
Southern or Northern analysis. Southern analysis is a method by which the presence of DNA 
sequences in a target nucleic acid mixture are identified by hybridization to a labeled probe, 
comprising an oligonucleotide or DNA fragment of a nucleic acid of the invention. Probes for 

30 Southern analysis may for example be at least 15 nucleotides in length. Southern analysis 
typically involves electrophoretic separation of DNA digests on agarose gels, denaturation of 
the DNA after electrophoretic separation, and transfer of the DNA to nitrocellulose, nylon, or 
another suitable membrane support for analysis with a radiolabeled, biotinylated, or enzyme- 
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labeled probe as described in Sambrook et aL (1989). Similarly. Northern analysis may be 
used to identify RNAs that hybridize to a known probe such as an oligonucleotide, DNA 
fragment cDNA or fragment thereof, or RNA fragment of a nucleic acid of the invention or a 
known FAE1 sequence. The probe may be labeled with a radioisotope such as 32 P, by 
5 biotinylation or with an enzyme. The RNA to be analyzed may be electrophoretically 
separated on an agarose or polyacrylamide gel, transferred to nitrocellulose, nylon, or other 
suitable membrane, and hybridized with the probe, using standard techniques well known in 
the art such as described in Sambrook et aL (1989). 

In alternative embodiments, a transcriptional regulatory region of the invention may 

10 be at least 70% identical when optimally aligned to the 5' region of a plant FAE1 gene, such 
as the Arabidopsis FAE1 gene. In alternative embodiments, the degree of identity may be 
between 50% and 100%, such as 60%, 80%, 90%, 95% or 99%. When a position in the 
compared sequence is occupied by the same nucleotide or amino acid, following optimal 
alignment of the sequences, the molecules are considered to have identity at that position. The 

1 5 degree of identity between sequences is a function of the number of matching positions shared 
by the sequences. In terms of percentage, identity is the sum of identical positions, divided by 
the total length over which the sequences are aligned, multiplied by 1 00. 

Various aspects of the present invention encompass nucleic acid or amino acid 
sequences that are homologous to other sequences. As the term is used herein, an amino acid 

20 or nucleic acid sequence is "homologous" to another sequence if the two sequences are 

substantially identical and the functional activity of the sequences is conserved (for example, 
both sequences function as or encode a FAE1 enzyme: as used herein, the term 'homologous' 
does not infer evolutionary relatedness). Nucleic acid sequences may also be homologous if 
they encode substantially identical amino acid sequences, even if the nucleic acid sequences 

25 are not themselves substantially identical, a circumstance that may for example arise as a 
result of the degeneracy of the genetic code. 

Two amino acid or nucleic acid sequences are considered substantially identical if, 
when optimally aligned (with gaps permitted), they share at least about 50% sequence 
similarity or identity, or if the sequences share defined functional motifs. In alternative 

30 embodiments, sequence similarity in optimally aligned substantially identical sequences may 
be at least 60%, 70%, 80%, 90% or 95%. As used herein, a given percentage of homology 
between sequences denotes the degree of sequence identity in optimally aligned sequences. 
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Optimal alignment of sequences for comparisons of similarity may be automated using 
a variety of algorithms, such as the local homology algorithm of Smith and Waterman (1981) 
Adv. Appl Math 2: 482, the homology alignment algorithm of Needleman and Wunsch (1970) 
J. Mol Biol. 48:443, the search for similarity method of Pearson and Lipman (1988) Proc. 
5 Natl Acad. Set USA 85: 2444, and the computerized implementations of these algorithms 
(such as GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, Madison, WI, U.S.A.). Sequence similarity may also be 
determined using the BLAST algorithm, described in Altschul et al (1990), 1 Mol Biol 
215:403-10 (using the published default settings). Software and instructions for performing 
1 0 BLAST analysis may be available through the National Center for Biotechnology Information 
in the United States (including the programs B LAS TP, BLASTN, BLASTX, TBLASTN and 
TBLASTX that may be available through the internet at http://www.ncbi.nlm.nih.gov/) . The 
BLAST algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some positive- 
1 5 valued threshold score T when aligned with a word of the same length in a database 
(reference) sequence. T is referred to as the neighborhood word score threshold. Initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs. The word hits 
are extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Extension of the word hits in each direction is halted when the 
20 following parameters are met: the cumulative alignment score falls off by the quantity X from 
its maximum achieved value; the cumulative score goes to zero or below, due to the 
accumulation of one or more negative-scoring residue alignments: or the end of either 
sequence is reached. The BLAST algorithm parameters W. T and X determine the sensitivity 
and speed of the alignment. The BLAST program may use as defaults a word length (W) of 
25 11, the BLOSUM62 scoring matrix (Henikoff and Henikoff (1992) Proc. Natl Acad. Sci. USA 
89: 10915-10919), a gap existence cost of 1 1, a per residue gap cost of 1, a lambda ratio of 
0.85, alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both 
strands. One measure of the statistical similarity between two sequences using the BLAST 
algorithm is the smallest sum probability (P(N)), which provides an indication of the 
30 probability by which a match between two nucleotide or amino acid sequences would occur 
by chance. In alternative embodiments of the invention, nucleotide or amino acid sequences 
are considered substantially identical if the smallest sum probability in a comparison of the 
test sequences is less than about 1, preferably less than about 0.1, more preferably less than 
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about 0.0 1, and most preferably less than about 0.001. In the PSI-BLAST implementation of 
the BLAST algorithm, an expect value for inclusion in PSI-BLAST iteration may be 0.001 
(Altschul et al. (1997), Nucleic Acids Res. 25:3389-3402). Searching parameters may be 
varied to obtain potentially homologous sequences from database searches. 
5 An alternative indication that two nucleic acid sequences are substantially identical is 

that the two sequences hybridize to each other under moderately stringent, or preferably 
stringent, conditions. Hybridization to filter-bound sequences under moderately stringent 
conditions may, for example, be performed in 0.5 M NaHP04, 7% sodium dodecyl sulfate 
(SDS), 1 mM EDTA at 65EC and washing in 0.2 x SSC/0.1% SDS at 42EC (see AusubeL et 

10 al. (eds), 1989, Current Protocols in Molecular Biology, Vol. 1, Green Publishing Associates, 
Inc., and John Wiley & Sons, Inc., New York, at p. 2.10.3). Alternatively, hybridization to 
filter-bound sequences under stringent conditions may, for example, be performed in 0.5 M 
NaHP0 4 , 7% SDS, 1 mM EDTA at 65EC, and washing in 0.1 x SSC/0.1% SDS at 68EC (see 
AusubeL et al. (eds), 1989, supra). Hybridization conditions may be modified in accordance 

1 5 with known methods depending on the sequence of interest (see Tijssen, 1 993, Laboratory 
Techniques in Biochemistry and Molecular Biology — Hybridization with Nucleic Acid 
Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic 
acid probe assays", Elsevier, New York). Generally, stringent conditions are selected to be 
about 5EC lower than the thermal melting point for the specific sequence at a defined ionic 

20 strength and pH. 

A FAE1 promoter is any naturally occurring transcriptional regulatory region that 
mediates or modulates the expression of a plant FAE1 condensing enzyme. Plant FAE1 
condensing enzymes are proteins that are homologous to known FAE1 condensing enzymes, 
such as those cloned and described in International Patent Publication WO 96/13582. 

25 Heterologous DNA sequences may for example be introduced into a host cell by 

transformation. Such heterologous molecules may include sequences derived from the host 
cell species, which have been isolated and reintroduced into cells of the host species. 
Heterologous nucleic acid sequences may become integrated into a host cell genome, either as 
a result of the original transformation of the host cells, or as the result of subsequent 

30 recombination events. Transformation techniques that may be employed include plant cell 
membrane disruption by electroporation, microinjection and polyethylene glycol based 
transformation (such as are disclosed in Paszkowski et al EMBOJ. 3:2717 (1984); Fromm et 
al., Proc. Natl Acad. Scl USA 82:5824 (1985); Rogers et al, Methods EnzymoL 1 18:627 
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(1986); and in U.S. Patent Nos. 4,684,61 1; 4,801,540; 4,743,548 and 5.231,019), biolistic 
transformation such as DNA particle bombardment (for example as disclosed in Klein, et al, 
Nature 327: 70 (1987); Gordon-Kamm, et al. "The Plant Cell" 2:603 (1990); and in U.S. 
Patent Nos. 4,945,050; 5,015,580; 5,149,655 and 5,466,587); Agrobacterium-mediaXed 
5 transformation methods (such as those disclosed in Horsch et al. Science 233 : 496 (1 984); 
Fraley etaL Proc. Nat 'I Acad Sci. USA 80:4803 (1983); and U.S. Patent Nos. 4,940,838 and 
5,464,763). 

Standard methods are available for the preparation of constructs for use in identifying 
and characterizing transcriptional regulatory regions useful in various embodiments of the 
1 0 invention. General molecular techniques may for example be performed by procedures 

generally described by Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith 
JA, Stuhl K. (1995) Current Protocols in Molecular Biology, Vols 1, 2 and 3. Alternative 
equivalent methods or variations thereof may be used in accordance with the general 
knowledge of those skilled in this art and the functional requirements of the present invention. 

1 5 In some aspects of the invention, transformed plant ceils may be cultured to regenerate 

whole plants having a transformed genotype and displaying a desired phenotype, as for 
example modified by the expression of a heterologous protein mediated by a transcriptional 
regulatory region of the invention. A variety of plant culture techniques may be used to 
regenerate whole plants, such as are described in Gamborg and Phillips, "Plant Cell. Tissue 

20 and Organ Culture, Fundamental Methods", Springer Berlin, 1995); Evans et al. "Protoplasts 
Isolation and Culture'', Handbook of Plant Cell Culture, Macmillian Publishing Company, 
New York, 1983: or Binding, "Regeneration of Plants, Plant Protoplasts", CRC Press, Boca 
Raton. 1985: or in Klee et aL Ann. Rev. of Plant Phys. 38:467 (1987). A cell, tissue, organ, or 
organism into which has been introduced a foreign nucleic acid, is considered "transformed", 

25 "transfected", or '"transgenic". A transgenic or transformed cell or organism also includes 
progeny of the cell or organism and progeny produced from a breeding program employing a 
transgenic plant as a parent in a cross and exhibiting an altered phenotype resulting from the 
presence of a recombinant nucleic acid construct. A transgenic plant is therefore a plant that 
has been transformed with a heterologous nucleic acid, or the progeny of such a plant that 

30 includes the transgene. The invention provides vectors, such as vectors for transforming plants 
or plant cells. The term "vector" in reference to nucleic acid molecule generally refers to a 
molecule that may be used to transfer a nucleic acid segment(s) from one cell to another. One 
of skill will recognize that after the nucleic acid is stably incorporated in transgenic plants and 
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confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a 
number of standard breeding techniques may be used, depending upon the species to be 
crossed. 

In various embodiments, the invention comprises plants transformed with the nucleic 
5 acids of the invention. In some embodiments, such plants will exhibit altered fatty acid 
content in one or more tissues. These aspects of the invention relate to all higher plants, 
including monocots and dicots, such as species from the genera Fragaria. Lotus, Medicago, 
Onobrychis, Triforium, Trigonelia, Wgna, Citrus, Linum. Geranium, Manihou Caucus, 
Arabidopsis, Brassica, Raphanus, Sinapis. Atropa, Capsicum, Hyoscyamus. Lycopersicon, 

10 Nicotiana, Solarium, Petunia, Digitalis, Majorana. Cichorium. Helianthus. Lactuca. Bromus, 
Asparagus, Antirrhinum, Heterocatlis, Nemesia, Pelargonium, Panicum, Penniserum. 
Ranunculus, Senecio. Salpiglossis. Cucarnis, Browallia, Glycine, Lolium, Zea, Triticum, 
Sorghum, and Datura. Such plants may include maize, wheat, rice, barley, soybean, beans, 
rapeseed, canola, alfalfa, flax, sunflower, cotton, clover, lettuce, tomato cucurbits, potato 

15 carrot, radish, pea lentils, cabbage, broccoli, brussel sprouts, peppers, apple, pear, peach. 

apricot carnations and roses. More specifically, in alternative embodiments, plants for which 
the invention may be used in modifying fatty acid content include oil crops of the Cruciferae 
family: canola, rapeseed {Brassica spp.), crambe (Crambe spp.), honesty (Lunaria spp.) 
lesquerella (Lesquerela spp.), and others; the Composirae family: sunflower (Helianthus 

20 spp.), safflower (Carthamus spp.), niger (Guizotia spp.) and others; the Palmae family: palm 
(Elaeis spp.), coconut (Cocos spp.) and others; the Leguminosae family: peanut (Arachis 
spp.), soybean (Glycine spp.) and others: and plants of other families such as maize (Zea spp.), 
cotton (Gossvpiun sp.), jojoba (Simonasia sp.), flax (Linum sp.), sesame (Sesamum spp.), 
castor bean (Ricinus spp.), olive (Olea spp.), poppy (Papaver spp.), spurge (Euphorbia, spp.), 

25 meadowfoam (Limnanthes spp.), mustard (Sinapis spp.) and cuphea (Cuphea spp.). 

Nucleic acids of the invention may also be used as a plant breeding tool, as molecular 
markers to aid in plant breeding programs. Such techniques would include using the gene 
itself as a molecular probe or using the DNA sequence to design PCR primers to use PCR 
based screening techniques in plant breeding programs. 

30 Deletion or insertion constructs may be useful for domain mapping to determine the 

functional domains or motifs of a transcriptional regulatory region derived from a FAE1 gene. 
An aspect of the invention is the construction and testing of such constructs, as described 
below for the 5' deletion construct of the A. thaliana FAE1 5' region. One aspect of the 
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invention comprises transcriptional regulatory regions that are derived from functionally 
important regions of a FAE1 promoter. As outlined above, the functionally important regions 
of a FAE1 promoter may be determined through routine assays. Alternatively, randomly 
selected portions of a a FAE1 promoter may be selected for use in routine assays to determine 
5 whether the selected region is capable of functioning as a transcriptional regulatory region in 
the context of the present invention. In various embodiments, regions of the Arabidopsis 
thaliana, Brassica napus or Lunaria annua promoters may be used. For example, the 
following motifs in the A J. FAE1 promoter may be used alone or in combination in novel 
transcriptional regulatory regions (see Figure 4): the CE-like elements (CE1 and CE3), the RY 

10 repeat motif, the G-boxes (G-boxl and G-box2), the A-300 box, the EMI ABA box. or the 
CTATTTTG element. Constructs of the invention comprising such motifs, deletions or 
insertions may be assayed for activity as transcriptional regulatory regions of the invention by 
testing for strong seed-specific activity providing expression of a sequence of interest (such as 
a reporter sequence) before the torpedo stage and persisting throughout embryo development. 

1 5 in accordance with standard testing methods that may be adapted from the methods disclosed 
herein. 

Alternative embodiments of the transcriptional regulatory regions of the invention may 
be identified using information available through NCBI databases at http://www.ncbi.nih.gov. 
In various embodiments, transcriptional regulatory regions derived from plant FAE1 
20 genes are shown to be capable of directing expression of desired genes at an early stage of 
development in a seed-specific manner in disparate plant species. In particular embodiments, 
the transcriptional regulatory regions of the invention may be used in a wide variety of 
dicotyledonous plants for modification of the seed phenotype. For example, new seed 
phenotypes may include: 
25 ( 1 ) altered seed fatty acid composition or seed oil composition and accumulation 

(2) altered seed protein or carbohydrate composition or accumulation 

(3) enhanced production of desirable endogenous seed products 

(4) suppression of production of undesirable gene products using antisense. co-suppression or 
ribozyme technologies 

30 (5) production of novel recombinant proteins for pharmaceutical, industrial or nutritional 
purposes 

Isolation of a seed-specific promoter from A. thaliana 
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Using the sequence information of the A. thaliana genome sequencing project, 
synthetic oligonucletide primers were designed to amplify the FAEI 5' untranslated region, to 
isolate it by PCR. As shown in Figure 1, the upstream primer 5'- 
CTAGTAGATTGGTTGGTTGGTTTCC-3 ' (AtproFW) in combination with the downstream 
5 primer 5 , -TGCTCTGTTTGTGTCGGAAAATAATGG-3 , (AtproRV) were used, and resulted 
in the synthesis of a fragment of the correct size (934 bp). The amplified product was 
subcloned in the Hindi site of the plasmid pT7T3-18U (Pharmacia) to produce plasmid 
pT7T3-18U/proFAE900, followed by complete sequence determination of both strands to 
verify the fragment identity. A BLAST search of the A. thaliana Database identified a single 
10 BAC clone T4L20 (GenBank ATF10M6) 125,179 bp long, which contains the complete 
FAE1 gene. 

Functional analysis of the FAEI 5' upstream region 

5' upstream fragments of the FAEI gene were shown to confer seed-specific and 
temporally regulated gene expression in plants. A translational fusion was made between the 

15 FAEI 5' region and the coding region of the reporter gene (3-glucuronidase (GUS). The 
chimeric gene (pFAE900-GUS or pFAE400-GUS) was transferred into Arabidopsis and 
tobacco and GUS activity was monitored in various tissue of transgenic plants. 

Construction of the vectors pFAE900-GUS and pFAE400-GUS, and transformation of 
Arabidopsis and tobacco, was as follows. The insert was cleaved out of pT7T3-18U vector 

20 with HindUl and Xbal and directionally subcloned into the corresponding sites of the binary 
Ti plasmid pBHOl (Clontech), which contains a promoterless GUS gene (Jefferson et al. 
1987), to obtain the vector pFAE900-GUS. Another construct, pFAE400-GUS. containing 
only 393 bp of the 5 ? FAEI region directly upstream of the ATG initiation codon fused to the 
GUS coding sequence was also generated. For that, the pT7T3-18U/proFAE900 vector was 

25 digested with BgUI and Pstl, the sticky ends were filled in using T4 DNA polymerase, 
followed by re-ligation to obtain pT7T3-18U/proFAE400. The 393 bp 5' FAEI upstream 
fragment was then excised with Hindlll and Xbal and cloned into the binary vector pBHOl to 
obtain the plasmid pFAE400-GUS. The pFAE400-GUS and pFAE900-GUS fusion constructs 
in pBHOl were introduced into Agrobacterium tumefaciens strain GV3101 (Koncz and 

30 Schell, 1986) by heat-shock and selected for resistance to kanamycin (50 jig/ml). A. thaliana 
(L.) Heynh. ecotype Columbia was transformed with the pFAE400-GUS and pFAE900-GUS 
constructs using floral dip method (Clough and Bent, 1998). Screening for transformed seed 
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was done on 50|4,g/mL kanamycin as described previously (Katavic et al., 1994). 
Approximately 100 transgenic lines were generated for each construct. 

For transformation of tobacco, A tumefaciens harbouring the pFAE900-GUS construct 
was co-cultivated with leaf pieces of Nicotiana tabacum SRI and transformants were selected 
5 with kanamycin (lOO^ig/mL) on solid medium (Lee and Douglas, 1996). 

Histochemical localization of GUS activity and analysis of transgenic plants was as 
follows. Tissue sections were placed in 100 mM NaP0 4 (pH7) and 1 mM spermidine for 15 

min, then incubated at 37° C in 0.5 K 3 [Fe(CN) 6 ], 0.01 % Triton X-100, ImM EDTA, 10 mM 
P-mercaptoethanol, 5-bromo-4-chloro-3-indolyl-P-D-glucuronide in 100 mM NaP0 4 (pH7), 

10 until a blue color appeared (after approximately 1 hr). Following incubation with the substrate, 
chlorophyll was removed from the sections using a graded ethanol series. 

Using this assay, five independent transgenic Arabidopsis lines were examined for the 
embryo-specific expression of the GUS gene. In addition, leaf stem and siliques were 
histochemically stained for p-glucuronidase activity. The results indicate that the reporter 

15 gene fused to the transcriptional regulatory region of the invention is not expressed in 
vegetative tissues, whereas it is highly expressed in developing seeds (embryos). Both the 934 
bp and the 393 bp transcriptional regulatory regions derived from the AA. FAE1 gene caused 
the appearance of GUS activity by the torpedo stage embryo (6 days after flowering). GUS 
activity in all five lines persisted throughout subsequent embryo development. 

20 Leaves, stems, pods and seeds of three regenerated tobacco lines transformed with the 

pFAE900-GUS construct were also assayed for ^-glucuronidase activity. The results obtained 
indicate that the 934 bp FAE1 promoter fragment contains sufficient information to direct 
seed-specific expression of a reporter gene in transgenic tobacco. Thus the transcriptional 
regulatory regions of the invention may be used for seed-specific expression of foreign genes 

25 in transgenic plants. 

The in vivo activity of a FAE1 promoter of the invention was compared to the activity 
of the napin promoter by expressing the castor bean hydroxylase gene FAH12 (Broun and 
Somerville, 1997) behind either the FAE1 -promoter (a transcriptional regulatory region of 
approximately 1 kb) or the napin promoter in an Arabidopsis fad2/fael double mutant. This 

30 mutant accumulates as a proportion of fatty acids about 85% of the 18:1 acyl group, which is 
the substrate for the hydroxylase. The levels of hydroxylated fatty acids accumulating in a 
large number of independent transgenic lines were used to estimate the relative strength of 
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each promoter. As shown in Figure 6. the two populations of transgenic plants accumulated 
levels of hydroxylated fatty acids, ranging from 0.2% to about 11-12% of total fatty acids, 
with the levels being on average slightly higher in FAE1-FAH12 lines. Similarly, the best 
FAE1-FAH12 plant accumulated just over 12% of hydroxylated fatty acids (w/w of total FAs), 
5 whereas the best napin-FAH12 plant produced 10.8% of hydroxylated fatty acids (w/w of total 
FAs). These results indicate that the FAE1 promoter is highly active in transgenic Arabidopsis 
and that its in vivo activity may be superior to napin in Arabidopsis seeds. 

Sequence elements or motifs that confer both tissue specificity and developmental 
regulation of transcription reside within 393 bp of the AUG translation initiation codon in the 

10 A.t. FAE1 gene. The seed-specific expression conferred by the transcriptional regulatory 
regions of the invention is independent of the native terminator of the FAE1 gene 3' end. For 
example, in the exemplified constructs disclosed herein, a terminator derived from the 
Agrobacterium nopaline synthase gene was used. 

Lunaria annua and Brassica napus FAE1 5* regulatory regions 

15 Two sequences originating from B. napus and L. annua were isolated and 

characterized to demonstrate that regulatory regions conferring seed-specific transcription 
early in embryo development can also be found upstream of other plant FAE1 genes. 
Sequences were cloned using the technique of polymerase chain reaction (PCR) walking on 
uncloned plant genomic DNA (Devic et al., 1997). Approximately 5 (ig of genomic DNA 

20 from 1 g of fresh tissue was used for the construction of 5 different libraries by digesting DNA 
with a series of enzymes that produce blunt end fragments to which special adaptors are 
ligated. The adaptor molecules consist of a long upper strand, which contains successive 
sequences common to the adaptor primers, API and AP2. annealed at its 3' end to a shorter 
strand lacking the API sequence. However, this short strand posseses an amine group at its 3' 

25 end to prevent filling in by the DNA poiymerses during the first PCR amplification step and 
generation of the API binding site. This suppression PCR effect prevents exponential 
amplification of molecules containing the adaptor at each end, and the adaptor primer binding 
sites are only produced when a strand complementary to the upper strand of the adaptor is 
synthesized by extension from a gene specific primer. The first PCR reaction is performed 

30 using an adaptor primer API and a gene specific primer. An aliquot of the first PCR product is 
used a template in a second PCR amplification using the nested gene specific primer and AP2. 

In order to isolate the regulatory regions upstream of the B, napus FAE1 coding 
sequence, genomic DNA was prepared from developing leaves and digested with 5 blunt-end 
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cutting restriction enzymes (Dral EcoRV Hpal PvuII and Seal) to generate a series of DNA 
libraries. After ligation of adapter molecules, individual libraries were used as templates in a 
two step PCR, In the first PCR amplification using the API primer 5*- 
GGATCCTAATACGACTCACTATAGGGC-3 ' and the FAE1 gene specific primer 5'- 
5 AAAG AGTGG AGCGATGGTTATGAGG-3 ' (Bnwalkl), multiple DNA fragments were 
amplified from all five library templates. After a second round of PCR, using the AP2 primer 
5'-CTATAGGGCTCGAGCGGC-3' and the nested FAE1 specific primer 5'- 
CGGAAAGAAGC AAAGGTTG AAAAGG-3 1 (Bnwalk2), the longest single fragment of 1.6 
kb was obtained from the Hpal library template. This fragment was inserted into the pCR2.1 

10 plasmid (Invitrogen) and sequenced. The sequence is shown in Figure 2. 

For the PCR walking experiment to isolate the L annua 5* regulatory region, in 
adition to the standard API and AP2 primers, the following FAE1 specific primers were used: 
5 ' -G ATCGTTTGTGGTAAG ACG AG AGC-3 ' (Lawalkl ) and 5'- 

GTCAGTGGG AAGAAACAGAGGTTG-3 ' (Lawalk2). In the first PCR reaction, the Dral 

15 EcoRV, Pvull Seal and Sspl library templates were used. In a second PCR amplification the 
longest single fragment 1.1 kb in length was synthesized using the EcoRV library template. 
This fragment was inserted into the Hindi site of the pT7T3-18U vector (Promega), 
sequenced on both strands and analyzed (Figure 3). 

Using the sequence data obtained for the 5' regulatory regions generated by PCR 

20 walking, specific primers were generated for the amplification of the L. annua and B. napus 
FAE1 promoter fragments. For the PCR-amplification of B. napus promoter fragment the 
upstream primer was 5 ' -CTG ACTTC ACC AAAGAAAC AACTCG-3 * (BnproFW) in 
combination with the downstream primer 5 ' -CGGAATTCCGTTTTTTTTTTTAGGCG-3 * 
(BnproRV). The synthesized fragment was ligated into the Smal site of pGEM-7Zf 

25 (Promega), then excised with Xbal/BamHI and cloned into the equivalent sites of the pBHOl 
binary vector (Clontech). L. annua 5' regulatory region was amplified using the 5"- 
CAGCTTAACCGGTAAAATTGGCC-3 ' (LaproFW) upstream primer together with the 5 ? - 
TGTTC AGTTTTGTGTCGGAGAGG-3 ' (LaproRV) downstream primer and inserted in the 
Hindi site of pT7T3-18U (Promega) plasmid. In order to clone the L annua promoter 

30 fragment into the pBHOl binary vector, an Xbal site was added by subcloning the Pstl/Kpnl 
fragment released from the pT7T3-18U vector into pBIuescript II KS+ (Stratagene). The 
fragment was then excised and cloned in ihzXbal site of the pBHOl vector. 
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The resulting vectors pBnFAEl-GUS and pLaFAEl-GUS in pBHOl were then 
introduced into A. tumefaciens strain GV3101 by heat-shock, and used to transform 
Arabidopsis as described above. Transformants were selected on agar-solidified medium 
containing kanamycin (50 ng/ml). More than 100 transformants were generated for each 
construct. The activity of the L annua and 5. napus FAE1 promoters was determined by GUS 
expression assays on the developing seeds and also on non-reproductive plant tissues as 
controls. Consistent seed-specific GUS expression was obtained for both promoter constructs 
in independent transgenic lines. In contrast, there was no detectable GUS activity in leaf stem 
and silique samples. 
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WHAT IS CLAIMED IS: 

1. A recombinant nucleic acid molecule comprising a heterologous promoter sequence 
operably linked to a nucleic acid sequence, wherein the promoter sequence comprises 
a transcriptional regulatory region capable of mediating seed-specific expression in 
Arabidopsis wherein the transcriptional regulatory region: 

(a) is obtainable from a 5* region of a plant FAEJ gene; or 

(b) hybridizes under stringent conditions to the 5' region of the plant FAE1 gene; 
or 

(c) is at least 70% identical when optimally aligned to the 5' region of the plant 
FAE1 gene. 



2. The recombinant nucleic acid of claim 1 wherein the 5' region of the plant FAE1 gene 
comprises (5' to V)\ 

AGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 
ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 
TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 
TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 
TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 
CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 
GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 
GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA. 



3. The recombinant nucleic acid of claim 1 wherein the 5 1 region of the plant FAEJ gene 
comprises (5' to 3'): 

AAGGCTTACC CTATTAGTTG AAAGTTGAAA CTTTGTTCCC TACTCAATTC 
CTAGTTGTGT AAATGTATGT ATATGTAATG CGTATAAAAC GTAGTACTTA 
AATGACTAGG AGTGGTTCTT GAGACCGATG AGAGATGGGA GCAGAACTAA 
AG AT GAT G AC ATAATTAAGA ACGAATTTGA AAGGCTCTTA GGTTTGAATC 
CTATTCGAGA ATGTTTTTGT CAAAGATAGT GGCGATTTTG AACCAAAGAA 
AACATTTAAA AAATCAGTAT CCGGTTACGT TCATGCAAAT AGAAAGTGGT 
CTAGGATCTG ATTGTAATTT TAGACTTAAA GAGTCTCTTA AGATTCAATC 
CTGGCTGTGT ACAAAACTAC AAATAATATA TTTTAGACTA TTTGGCCTTA 
ACTAAACTTC CACTCATTAT TTACTGAGGT TAGAGAATAG ACTTGCGAAT 
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AAACACATTC CCGAGAAATA CTCATGATCC CATAATTAGT CAGAGGGTAT 
GCCAATCAGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 
ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 
TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 
TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 
TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 
CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 
GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 
GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA. 

The recombinant nucleic acid of claim 1 wherein the 5* region of the plant FAE1 gene 
comprises (5' to 3'): 

CTGACTTC ACCAAAGAAA CAACTCGAGT CGTTATCCAT 
CTCCTCATAA CCATCGCTCC ACTCTTTGCC TTCACCGTTT TCGGTTCGGT 
TCTCTACATC GCAACCCGGC CCAAACCGGT TTACCTCGTT GAGTACTCAT 
GCTACCTTCC ACCAACGCAT TGTAGATCAA GTATCTCCAA GGTCATGGAT 
ATCTTTTATC AAGTAAGAAA AGCTGATCCT TCTCGGAACG GCACGTGCGA 
TGACTCGTCG TGGCTTGACT TCTTGAGGAA GATTCAAGAA CGTTCAGGTC 
TAGGCGATGA AACTCACGGG CCCGAGGGGC TGCTTCAGGT CCCTCCCCGG 
AAGACTTTTG CGGCGGCGCG TGAAGAGACG GAGCAAGTTA TCATTGGTGC 
GCTAGAAAAT CTATTCAAGA ACACCAACGT TAACCCTAAA GATATAGGTA 
TACTTGTGGT GAACTCAAGC ATGTTTAATC CAACTCCATC GCTCTCCGCG 
ATGGTCGTTA ACACTTTCAA GCTCCGAAGC AACGTAAGAA GCTTTAACCT 
TGGTGGCATG GGTTGTAGTG CCGGCGTTAT AGCCATTGAT CTAGCAAAGG 
ACTTGTTGCA TGTCCATAAA AATACGTATG CTCTTGTGGT GAGCACAGAG 
AACATCACTT ATAACATTTA CGCTGGTGAT AATAGGTCCA TGATGGTTTC 
AAATTGCTTG TTCCGTGTTG GTGGGGCCGC TATTTTGCTC TCCAACAAGC 
CTGGAGATCG TAGACGGTCC AAGTACGAGC TAGTTCACAC GGTTCGAACG 
CATACCGGAG CTGACGACAA GTCTTTTCGT TGCGTGCAAC AAGGAGACGA 
TGAGAACGGC AAAATCGGAG TGAGTTTGTC CAAGGACATA ACCGATGTTG 
CTGGTCGAAC GGTTAAGAAA AACATAGCAA CGTTGGGTCC GTTGATTCTT 
CCGTTAAGCG AGAAACTTCT TTTTTTCGTT ACCTTCATGG GCAAGAAACT 
TTTCAAAGAT AAAATCAAAC ATTACTACGT CCCGGATTTC AAACTTGCTA 
TTGACCATTT TTGTATACAT GCCGGAGGCA GAGCCGTGAT TGATGTGCTA 
GAGAAGAACC TAGCCCTAGC ACCGATCGAT GTAGAGGCAT CAAGATCAAC 
GTTACATAGA TTTGGAAACA CTTCATCTAG CTCAATATGG TATGAGTTGG 
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CATACATAGA AGCAAAAGGA AGGATGAAGA AAGGTAATAA AGTTTGGCAG 
ATTGCTTTAG GGTCAGGCTT TAAGTGTAAC AGTGCAGTTT GGGTGGCTCT 
AAACAATGTC AAAGCTTCGA CAAATAGTCC TTGGGAACAC TGCATCGACA 
GATACCCGGT CAAAATTGAT TCTGATTCAG GTAAGTCAGA GACTCGTGTC 
CAAAACGGTC GGTCCTAATA AACGATGTTT GCTCTCTTTC GTTTCTTTTT 
ATTTGTTATA ATAATTTGAT GGCTACGATG TTTCTCTTGT TTGTTATGAA 
TAAAGAATGC AATGGTGTTC TAGTATTTGA TTGTTTTACA TGTATGTATC 
TCTTATTTAC ATGAAATTTT TAAACGCCTA AAAAAAAAAA CGGAATTCCG. 

The recombinant nucleic acid of claim 1 wherein the 5' region of the plant FAE1 gene 
comprises (5* to 3'): 

CAGCTTAAC CGGTAAAATT 
GGCCTGTACA TATATTTACC ACTGAGTAAA GACATCAGTT AATGATTTGT 
TGTTACTCAA TTGGGCTAAG TGTATTATTA TATGTGTTGT ATATAATAAA 
GGTAGAACGT AAATTTACTA AGAATGTGTT TTTCCAATGT GATTGCTCTT 
TGGCCTCTTA GGTTTGAATC CTACTCGAGA AGACTAATTT TAATTTACTG 
GCAAAAATAG AAATCAATTT ATAAGTGTTT AAACAAATCG ATGGTATAAC 
TGATTAGTGA TCACTCTTAG GTTTTGATCC AACTCGAGTA TTGAGTATTG 
AACGCTTTTT TTAAATAAAA TCTTGATTTT TAAATTGGTT TTTTGAGTAA 
AAAAGTTCTT AATATTTTCT CTTTGTTTTA ATGGGTTTGT TTTGCATTTT 
ATAAGCTTAA TTTTTCTAAT TTAATATTTT ATCTATCATC GTCCGTAAAG 
TTTTATTTGG CACAAACTTG TTTTACTTTT CTACCTTATA ATTTGGGAAC 
TGGTTGAGTC AAAGCGTACC GGACAAATAT GTTTTATATT CTTATTTAAG 
AATTAACACT CATCTCATAA TTAGTCAGAG GCTAGGGAGA TTCAGCCAAT 
CAATGCTAAC AACAAAATTC TCTTAATGAT CTAACGATGC TATTTAATAT 
TCGGATCAGT ATTCTTAAAT AAGAATATAA AACTAATTCA ATAGTTACAG 
ATAAAAACTT ATATAGACTT TTTTATTTGG AATATAAAAG TATCAATATA 
TTATAGACAA TATTTATAAC GTTAAAAATA CAATATTTAT ATTTTTTATA 
TATTTATTTC AAATTGAAAA GC AT TACT TC TATCGAAATG AATTTTAGTA 
TATTAATTAA TATTTTTTTA ATCGGACTAC TTTCCTATTT TGGCACCTTT 
CATCTGACTA CTAATTTATT TCAATGTGTA TGCATGCATG AG CAT GAG TA 
ATACACATGT CTATATAAAT GCATGTAAAA CGTAACGGAC CACAAAAGTG 
GATCCATACA AATACATCTC ATCGCACCCT CTCCGACACA AAACTGAACA. 
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5 7. 



8. 

10 



15 



20 

9. 



25 



30 



The recombinant nucleic acid of claim 1 wherein the promoter sequence is selected 
from the group consisting of Arabidopsis thaliana, Lunaria annua and Brassica napus 
FAE1 promoter sequences. 

The recombinant nucleic acid of any one of claims 1 through 6, wherein the 
transcriptional regulatory region is at least 70% identical when optimally aligned to 
the 5' region of the plant FAE1 gene.. 



The recombinant nucleic acid of claim 1 wherein the transcriptional regulatory region 
comprises (5' to 3'): 



AGA 


TCTAAGAACA 


CACATTCCCT 


CAAATTTTAA 


TGCACATGTA 


ATCATAGTTT 


AGCACAATTC 


AAAAATAATG 


TAGTATTAAA 


GACAGAAATT 


TGTAGACTTT 


TTTTTGGCGT 


TAAAGGAAGA 


CTAAGTTTAT 


ACGTACATTT 


TATTTTAAGT 


GGAAAACCGA 


AATTTTCCAT 


CGAAATATAT 


GAATTTAGTA 


TATATATTTC 


TGCAATGTAC 


TATTTTGCTA 


TTTTGGCAAC 


TTTCAGTGGA 


CTACTACTTT 


ATTACAATGT 


GTATGGATGC 


ATGAGTTTGA 


GTATACACAT 


GTCTAAATGC 


ATGCTTTGCA 


AAACGTAACG 


GACCACAAAA 


GAGGATCCAT 


GCAAATACAT 


CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA. 



The recombinant nucleic acid of claim 1 wherein the transcriptional regulatory region 
comprises (5 1 to 3'): 

AAGGCTTACC CTATTAGTTG AAAGTTGAAA CTTTG7TCCC TACTCAATTC 
CTAGTTGTGT AAATGTATGT ATATGTAATG CGTATAAAAC GTAGTACTTA 
AATGACTAGG AGTGGTTCTT GAGACCGATG AGAGATGGGA GCAGAACTAA 
AGATGATGAC ATAATTAAGA ACGAATTTGA AAGGCTCTTA GGTTTGAATC 
CTATTCGAGA ATGTTTTTGT CAAAGATAGT GGCGATTTTG AACCAAAGAA 
AACATTTAAA AAATCAGTAT CCGGTTACGT TCATGCAAAT AGAAAGTGGT 
CTAGGATCTG ATTGTAATTT TAGACTTAAA GAGTCTCTTA AGATTCAATC 
CTGGCTGTGT ACAAAACTAC AAATAATATA TTTTAGACTA TTTGGCCTTA 
ACTAAACTTC CACTCATTAT TTACTGAGGT TAGAGAATAG ACTTGCGAAT 
AAACACATTC CCGAGAAATA CTCATGATCC CATAATTAGT CAGAGGGTAT 
GCCAATCAGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 
ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 
TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 
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TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 
TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 
CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 
GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 
GCAAATACAT CTCATAGCTT CCTCCATTAT TTTCCGACAC AAACAGAGCA. 

The recombinant nucleic acid of claim I wherein the transcriptional regulatory region 
comprises (5' to 3'): 

CTGACTTC ACCAAAGAAA CAACTCGAGT CGTTATCCAT 
CTCCTCATAA CCATCGCTCC ACTCTTTGCC TTCACCGTTT TCGGTTCGGT 
TCTCTACATC GCAACCCGGC CCAAACCGGT TTACCTCGTT GAGTACTCAT 
GCTACCTTCC ACCAACGCAT TGTAGATCAA GTATCTCCAA GGTCATGGAT 
ATCTTTTATC AAGTAAGAAA AGCTGATCCT TCTCGGAACG GCACGTGCGA 
TGACTCGTCG TGGCTTGACT TCTTGAGGAA GATTCAAGAA CGTTCAGGTC 
TAGGCGATGA AACTCACGGG CCCGAGGGGC TGCTTCAGGT CCCTCCCCGG 
AAGACTTTTG CGGCGGCGCG TGAAGAGACG GAGCAAGTTA TCATTGGTGC 
GCTAGAAAAT CTATTCAAGA ACACCAACGT TAACCCTAAA GATATAGGTA 
TACTTGTGGT GAACTCAAGC ATGTTTAATC CAACTCCATC GCTCTCCGCG 
ATGGTCGTTA ACACTTTCAA GCTCCGAAGC AACGTAAGAA GCTTTAACCT 
TGGTGGCATG GGTTGTAGTG CCGGCGTTAT AGCCATTGAT CTAGCAAAGG 
ACTTGTTGCA TGTCCATAAA AATACGTATG CTCTTGTGGT GAGCACAGAG 
AACATCACTT ATAACATTTA CGCTGGTGAT AATAGGTCCA TGATGGTTTC 
AAATTGCTTG TTCCGTGTTG GTGGGGCCGC TATTTTGCTC TCCAACAAGC 
CTGGAGATCG TAGACGGTCC AAGTACGAGC TAGTTCACAC GGTTCGAACG 
CATACCGGAG CTGACGACAA GTCTTTTCGT TGCGTGCAAC AAGGAGACGA 
TGAGAACGGC AAAAT CGGAG TGAGTTTGTC CAAGGACATA ACCGATGTTG 
CTGGTCGAAC GGTTAAGAAA AACATAGCAA CGTTGGGTCC GTTGATTCTT 
CCGTTAAGCG AGAAACTTCT TTTTTTCGTT ACCTTCATGG GCAAGAAACT 
TTTCAAAGAT AAAATCAAAC ATTACTACGT CCCGGATTTC AAACTTGCTA 
TTGACCATTT TTGTATACAT GCCGGAGGCA GAGCCGTGAT TGATGTGCTA 
GAGAAGAACC TAGCCCTAGC ACCGATCGAT GTAGAGGCAT CAAGATCAAC 
GTTACATAGA TTTGGAAACA CTTCATCTAG CTCAATATGG TATGAGTTGG 
CATACATAGA AGCAAAAGGA AGGATGAAGA AAGGTAATAA AGTTTGGCAG 
ATTGCTTTAG GGTCAGGCTT TAAGTGTAAC AGTGCAGTTT GGGTGGCTCT 
AAACAATGTC AAAGCTTCGA CAAATAGTCC TTGGGAACAC TGCATCGACA 
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GATACCCGGT CAAAATTGAT TCTGATTCAG GTAAGTCAGA GACTCGTGTC 

CAAAACGGTC GGTCCTAATA AACGATGTTT GCTCTCTTTC GTTTCTTTTT 

ATTTGTTATA ATAATTTGAT GGCTACGATG TTTCTCTTGT TTGTTATGAA 

TAAAGAATGC AATGGTGTTC TAGTATTTGA TTGTTTTACA TGTATGTATC 

TCTTATTTAC ATGAAATTTT TAAACGCCTA AAAAAAAAAA CGGAATTCCG. 

The recombinant nucleic acid of claim 1 wherein the transcriptional regulatory region 
comprises (5 1 to 3 f ): 









CAGCTTAAC 


CGGTAAAATT 


GGCCTGTACA 


TATATTTACC 


ACTGAGTAAA 


GACATCAGTT 


AATGATTTGT 


TGTTACTCAA 


TTGGGCTAAG 


TGTATTATTA 


TATGTGTTGT 


ATATAATAAA 


GGTAGAACGT 


AAATTTACTA 


AGAATGTGTT 


TTTCCAATGT 


GATTGCTCTT 


TGGCCTCTTA 


GGTTTGAATC 


CTACTCGAGA 


AGACTAATTT 


TAATTTACTG 


GCAAAAATAG 


AAATCAATTT 


ATAAGTGTTT 


AAACAAATCG 


ATGGTATAAC 


TGATTAGTGA 


TCACTCTTAG 


GTTTTGATCC 


AACTCGAGTA 


TTGAGTATTG 


AACGCTTTTT 


TTAAATAAAA 


TCTTGATTTT 


TAAATTGGTT 


TTTTGAGTAA 


AAAAGTTCTT 


AATATTTTCT 


CTTTGTTTTA 


ATGGGTTTGT 


TTTGCATTTT 


ATAAGCTTAA 


TTTTTCTAAT 


TTAATATTTT 


ATCTATCATC 


GTCCGTAAAG 


TTTTATTTGG 


CACAAACTTG 


TTTTACTTTT 


CTACCTTATA 


ATTTGGGAAC 


TGGTTGAGTC 


AAAGCGTACC 


GGACAAATAT 


GTTTTATATT 


CTTATTTAAG 


AATTAACACT 


CATCTCATAA 


TTAGTCAGAG 


GCTAGGGAGA 


TTCAGCCAAT 


CAATGCTAAC 


AACAAAATTC 


TCTTAATGAT 


CTAACGATGC 


TATTTAATAT 


TCGGATCAGT 


ATTCTTAAAT 


AAGAATATAA 


AACTAATTCA 


ATAGTTACAG 


ATAAAAACTT 


AT AT AG AC TT 


TTTTATTTGG 


AATATAAAAG 


TATCAATATA 


TTATAGACAA 


TATTTATAAC 


GTTAAAAATA 


CAATATTTAT 


ATTTTTTATA 


TATTTATTTC 


AAATTGAAAA 


GCATTACTTC 


TATCGAAATG 


AATTTTAGTA 


TATTAATTAA 


TATTTTTTTA 


ATCGGACTAC 


TTTCCTATTT 


TGGCACCTTT 


CATCTGACTA 


CTAATTTATT 


TCAATGTGTA 


TGCATGCATG 


AGCATGAGTA 


ATACACATGT 


CTATATAAAT 


GCATGTAAAA 


CGTAACGGAC 


CACAAAAGTG 


GATCCATACA 


AATACATCTC 


ATCGCACCCT 


CTCCGACACA 


AAACTGAACA . 



The recombinant nucleic acid of any one of claims 1 through 1 1 wherein the nucleic 
acid sequence encodes a translatable mRNA. 
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13. The recombinant nucleic acid of claim 12 wherein the nucleic acid sequence encodes 
an enzyme involved in lipid metabolism. 

14. The recombinant nucleic acid of any one of claims 1 through 13, further comprising a 
transcription termination region operably linked to the nucleic acid sequence. 

15. A host cell comprising the recombinant nucleic acid of any one of claims 1 through 14. 

1 6. The host cell of claim 1 5, wherein the host cell is of a dicotyledonous plant species. 

17. A plant comprising the recombinant nucleic acid of any one of claims 1 through 14. 

18. The plant of claim 1 7, wherein the plant is of a dicotyledonous plant species. 

1 9. A method of altering the phenotype of a seed comprising: 

a) transforming a seed-bearing plant, or a progenitor of the seed-bearing plant 
with a vector comprising the nucleic acid of any one of claims 1 through 14; 

b) growing the seed-bearing plant to obtain seed under conditions wherein the 
nucleic acid sequence is expressed during embryogenesis under the control of 
the transcriptional regulatory region to alter the phenotype of the seed. 

20. A method of transforming a plant cell comprising transforming the plant cell with the 
recombinant nucleic acid of any one of claims 1 through 14. 
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Figure 1. Arabidopsis thaliana FAE1 promoter: 
(Length: 934 bp) 

-950 ACTCA TAAAA ACTAG TAGATTGGTT GGTTGGTTTC CA TGTACCAG 

AtproFW 

-900 AAGGCTTACC CTATTAGTTG AAAGTTGAAA CTTTGTTCCC TACTCAATTC 

-850 CTAGTTGTGT AAATGTATGT ATATGTAATG CGTATAAAAC GTAGTACTTA 

-800 AATGACTAGG AGTGGTTCTT GAGACCGATG AGAGATGGGA GCAGAACTAA 

-750 AGATGATGAC ATAATTAAGA ACGAATTTGA AAGGCTCTTA GGTTTGAATC 

-700 CTATTCGAGA ATGTTTTTGT CAAAGATAGT GGCGATTTTG AACCAAAGAA 

-650 AACATTTAAA AAATCAGTAT CCGGTTACGT TCATGCAAAT AGAAAGTGGT 

-600 CTAGGATCTG ATTGTAATTT TAGACTTAAA GAGTCTCTTA AGATTCAATC 

-550 CTGGCTGTGT ACAAAACTAC AAATAATATA TTTTAGACTA TTTGGCCTTA 

-500 ACTAAACTTC CACTCATTAT TTACTGAGGT TAGAGAATAG ACTTGCGAAT 

-450 AAACACATTC CCGAGAAATA CT CAT GAT CC CATAATTAGT CAGAGGGTAT 

-400 GCCAATCAGA TCTAAGAACA CACATTCCCT CAAATTTTAA TGCACATGTA 

-350 ATCATAGTTT AGCACAATTC AAAAATAATG TAGTATTAAA GACAGAAATT 

-300 TGTAGACTTT TTTTTGGCGT TAAAGGAAGA CTAAGTTTAT ACGTACATTT 

-250 TATTTTAAGT GGAAAACCGA AATTTTCCAT CGAAATATAT GAATTTAGTA 

-200 TATATATTTC TGCAATGTAC TATTTTGCTA TTTTGGCAAC TTTCAGTGGA 

-150 CTACTACTTT ATTACAATGT GTATGGATGC ATGAGTTTGA GTATACACAT 

-100 GTCTAAATGC ATGCTTTGCA AAACGTAACG GACCACAAAA GAGGATCCAT 

-50 GCAAATACAT CTCATAGCTT CCT CCATTAT TTTCCGACAC AAACAGAGCA 

AtproRV 

1 [ ATG ACGTCCG TTAACGTTAA GCTCCTT 
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Figure 2. Brassica napus FAE1 promoter: 
(Length: 1588 bp) 

-1600 GGTTGGGCAA AT CTGACTTC ACCAAAGAAA CAACTCGA GT CGTTATCCAT 

BnproFW 

-1550 CTCCTCATAA CCATCGCTCC ACTCTTTGCC TTCACCGTTT TCGGTTCGGT 
-1500 TCTCTACATC GCAACCCGGC CCAAACCGGT TTACCTCGTT GAGTACTCAT 
-1450 GCTACCTTCC ACCAACGCAT TGTAGATCAA GTATCTCCAA GGTCATGGAT 
-1400 ATCTTTTATC AAGTAAGAAA AGCTGATCCT TCTCGGAACG GCACGTGCGA 
-1350 TGACTCGTCG TGGCTTGACT TCTTGAGGAA GATTCAAGAA CGTTCAGGTC 
-1300 TAGGCGATGA AACTCACGGG CCCGAGGGGC TGCTTCAGGT CCCTCCCCGG 
-1250 AAGACTTTTG CGGCGGCGCG TGAAGAGACG GAGCAAGTTA TCATTGGTGC 
-1200 GCTAGAAAAT CTATTCAAGA ACACCAACGT TAACCCTAAA GATATAGGTA 
-1150 TACTTGTGGT GAACTCAAGC ATGTTTAATC CAACTCCATC GCTCTCCGCG 
-1100 ATGGTCGTTA ACACTTTCAA GCTCCGAAGC AACGTAAGAA GCTTTAACCT 
-1050 TGGTGGCATG GGTTGTAGTG CCGGCGTTAT AGCCATTGAT CTAGCAAAGG 
-1000 ACTTGTTGCA TGTCCATAAA AATACGTATG CTCTTGTGGT GAGCACAGAG 
-950 AACATCACTT ATAACATTTA CGCTGGTGAT AATAGGTCCA TGATGGTTTC 
-900 AAATTGCTTG TTCCGTGTTG GTGGGGCCGC TATTTTGCTC TCCAACAAGC 
-850 CTGGAGATCG TAGACGGTCC AAGTACGAGC TAGTTCACAC GGTTCGAACG 
-800 CATACCGGAG CTGACGACAA GTCTTTTCGT TGCGTGCAAC AAGGAGACGA 
-750 TGAGAACGGC AAAATCGGAG TGAGTTTGTC CAAGGACATA ACCGATGTTG 
-700 CTGGTCGAAC GGTTAAGAAA AACATAGCAA CGTTGGGTCC GTTGATTCTT 
-650 CCGTTAAGCG AGAAACTTCT TTTTTTCGTT ACCTTCATGG GCAAGAAACT 
-600 TTTCAAAGAT AAAATCAAAC ATTACTACGT CCCGGATTTC AAACTTGCTA 
-550 TTGACCATTT TTGTATACAT GCCGGAGGCA GAGCCGTGAT TGATGTGCTA 
-500 GAGAAGAACC TAGCCCTAGC ACCGATCGAT GTAGAGGCAT CAAGATCAAC 
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Figure 2 Continued Brassica napus FAE1 promoter: 

-450 GTTACATAGA TTTGGAAACA CTTCATCTAG CTCAATATGG TATGAGTTGG 

-400 CATACATAGA AGCAAAAGGA AGGATGAAGA AAGGTAATAA AGTTTGGCAG 

-350 ATTGCTTTAG GGTCAGGCTT TAAGTGTAAC AGTGCAGTTT GGGTGGCTCT 

-300 AAACAATGTC AAAGCTTCGA CAAATAGTCC TTGGGAACAC TGCATCGACA 

-250 GATACCCGGT CAAAATTGAT TCTGATTCAG GTAAGTCAGA GACTCGTGTC 

-200 CAAAACGGTC GGTCCTAATA AACGATGTTT GCTCTCTTTC GTTTCTTTTT 

-150 ATTTGTTATA ATAATTTGAT GGCTACGATG TTTCTCTTGT TTGTTATGAA 

-100 TAAAGAATGC AATGGTGTTC TAGTATTTGA TTGTTTTACA TGTATGTATC 

-50 TCTTATTTAC ATGAAATTTT TAAAC GCCTA AAAAAAAAAA CGGAATTCCG 

BnproRV 

1 [ATGj ACGTCCA TTAACGTAAA GCTCCTTTAC CATTACGTCA TAACCA ACCT 

51 TTTCAACCTT TGCTTCTTTC CG TTAACGGC GATCGTCGCC GGAAAAGCCT 
Bnwalk2 

101 ATCGGCTTAC CATAGACGAT CTTCACCACT TATACTATTC CTATCTCCAA 

151 CACA ACCTCA TAACCATCGC TCCACTCTTT GCCTTCACCG 
<- Bnwalkl 
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Figure 3. 



Lunaria annua FAE1 promoter: 
(Length: 1069 bp) 



-1100 



CG CCGGGGAGTT TCAGCTTAAC CGGTAAAATT 



LaproFW ^ 



-1050 GGCCTGTACA TATATTTACC ACTGAGTAAA GACATCAGTT AATGATTTGT 
-1000 TGTTACTCAA TTGGGCTAAG TGTATTATTA TATGTGTTGT ATATAATAAA 
-950 GGTAGAACGT AAATTTACTA AGAATGTGTT TTTCCAATGT GATTGCTCTT 
-900 TGGCCTCTTA GGTTTGAATC CTACTCGAGA AGACTAATTT TAATTTACTG 
-850 GCAAAAATAG AAATCAATTT ATAAGTGTTT AAACAAATCG ATGGTATAAC 
-800 TGATTAGTGA TCACTCTTAG GTTTTGATCC AACTCGAGTA TTGAGTATTG 
-750 AACGCTTTTT TTAAATAAAA TCTTGATTTT TAAATTGGTT TTTTGAGTAA 
-700 AAAAGTTCTT AATATTTTCT CTTTGTTTTA ATGGGTTTGT TTTGCATTTT 
-650 ATAAGCTTAA TTTTTCTAAT TTAATATTTT ATCTATCATC GTCCGTAAAG 
-600 TTTTATTTGG CACAAACTTG TTTTACTTTT CTACCTTATA ATTTGGGAAC 
-550 TGGTTGAGTC AAAGCGTACC GGACAAATAT GTTTTATATT CTTATTTAAG 
-500 AATTAACACT CATCTCATAA TTAGTCAGAG GCTAGGGAGA TTCAGCCAAT 
-450 CAATGCTAAC AACAAAATTC TCTTAATGAT CTAACGATGC TATTTAATAT 
-400 TCGGATCAGT ATTCTTAAAT AAGAATATAA AACTAATTCA ATAGTTACAG 
-350 ATAAAAACTT ATATAGACTT TTTTATTTGG AATATAAAAG TATCAATATA 
-300 TTATAGACAA TATTTATAAC GTTAAAAATA CAATATTTAT ATTTTTTATA 
-250 TATTTATTTC AAATTGAAAA GCATTACTTC TATCGAAATG AATTTTAGTA 
-200 TATTAATTAA TATTTTTTTA ATCGGACTAC TTTCCTATTT TGGCACCTTT 
-150 CATCTGACTA CTAATTTATT TCAATGTGTA TGCATGCATG AG CAT GAG T A 
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Figure 3 Continued. Lunaria annua FAE1 promoter: 

-100 ATACACATGT CTATATAAAT GCATGTAAAA CGTAACGGAC CACAAAAGTG 

-50 GATCCATACA AATACATCTC ATCGCAC CCT CTCCGACACA AAACTGAACA 
^" LaproRV 

i [atgIacgtctg TGAACGTAAA actcctttac cattacgtca taaccaactt 

51 TTT CAACCTC TGTTTCTTCC CACTGAC GGG GATCCTCGCC GGAAAAGGCT 
<T LawalJt2 
101 CTCGTCTTAC CACAAACGAT C TCCACCA 
4r Lawalkl 
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Figure 4: Alignment of A.L, L.a. and B.n. FAE1 promoters 

CLUSTAL W {1.74) multiple sequence alignment 



A. t . 

L.a . 

B. n. GG77GGGCAAA7C7GAC77CACCAAAGAAACAAC7CGAG7CG77A7CCA7C7CC7CA7AA 

A. t. 

L.a. 

B. n. CCATCGCTCCACTCTTTGCCTTCACCGTTTTCGGTTCGGTTCTCTACATCGCAACCCGGC 

A. t. 

L.a. 

B. n. CCAAACCGGTTTACCTCGTTGAGTACTCATGCTACCTTCCACCAACGCATTGTAGATCAA 

A. t. 

L.a. 

B. n. G7A7C7CCAAGG7CA7GGA7A7C7777A7CAAG7AAGAAAAGC7GA7CC77C7CGGAACG 

A. C. 

L.a. 

B. n. GCACGTGCGATGACTCGTCGTGGCTTGACTTCTTGAGGAAGA7TCAAGAACGTTCAGGTC 

A. t. 

L.a. 

B. n. TAGGCGATGAAACTCACGGGCCCGAGGGGCTGCTTCAGGTCCCTCCCCGGAAGACT7TTG 

A. t . 

L.a. 

B. n. CGGCGGCGCGTGAAGAGACGGAGCAAGTTATCATTGG7GCGCTAGAAAATCTA7TCAAGA 

A. t. 

L.a. 

B. n. ACACCAACG77AACCC7AAAGA7A7AGG7A7AC77G7GG7GAAC7CAAGCA7G777AA7C 

A. C. 

L.a. CGCCGGGGAGTTTCAGCT7AA 

B. n. CAAC7CCA7CGC7C7CCGCGA7GGTCG77AACAC777CAAGC7CCGAAGCAACG7AAGAA 

Con. 4 NNNNNNNNNNNNNNNNNNNNN 

A. t . AC7CA7AAAAAC7AG7AGA7 — TGG77GG 

L.a. CCGG7AAAA77GGCC7G7ACA7A7A777ACCAC7GAG7AAAGACA7CAG77AA7GA777G 

B. n. GC777AACC77GG7GGCA7GGG77G7AG7GCCGGCG77A7AGCCA77GA7C7AGCAAAGG 

Con . 4 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNMS KS RKW7WARM YCKYRRWYNNKSRWWKG 
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Figure 4 Continued: Alignment of At, L.cu and B.n. FAE1 promoters 



A. t. TTGGTTTCCA--TGTACCAGAAGGCTTACCCTAT-TAGTTGAAAGTTGAAACTTTGTTCC 
L.a. 77G77ACTCAA77GGGC7AAGTG7A7TA77A7A7-G7G7TG7ATA7AATAAAGG7AGAAC 

B. n. ACT7GT7GCATGTCCATAAAAA7ACG7A7GC7C77G7GG7GAGCACAGAGAACA7CAC7T 

** * * ******* * * 

Con . 4 WYKKKWYBCANNTSBRYHARRWKDMKTAYBMTMTNKWGKTGWRHRYWRWRAMBDTVDHHY 

A. C . C7AC7CAA77CCTAG7TG7G7AAA7GT ATGTATATGTAAT GCGTATAAAACGTA 

L.a. GTAA--ATTTACTAAGAATGTGTTTTTCCAATGTGATTGCTCTTTGGCCTCTTAGGTTTG 

B. n. ATAA-CATT7ACGCTGG7GA7AATAGG7CCA7GA7GGT77CAAAT7GCT7GTTCCG7GT7 

** ** ** * * * *** * * ** * * + 

Con . 4 VTAMNNAWTTMC^DKDDKRTRWWWKKNNNATGWDDDTKYHMWNNNGCBTVTWMVRYKTD 

A. t. GTACTTAAATGACTAGGAGTGGTTCTTGAGACCGATGAGAGATGGGAG-CAGAACTAAAG 
L.a. AATCC7AC7CGAGAAG-ACTAAT7T7AA7T7AC7GGCAAAAATAGAAA-TCAAT77A7AA 

B. n. GGTGGGGC-CG CTATTTTGC TCTCCAACAAGCCTGGAGATCGTAGACGGTCCAAGTACGA 

* ' " ""*" * * * * * * * 

Con . 4 RDWSBKRMNYGMBWWKNWS YDVTYYWWVWDDMCKRKVRRWVRTRGRMRNYMVAWBTAHRR 

A. t. AT— GATGACATAATTA AGAACGAATTTGA-AAGG-C7C77AGG77TGAA7CCT 

L.a. GT— GTTTAAACAAATCGATGGTATAACTGATTAGT-GATCAC7CTTAGG7TTTGATCCA 

B. n. GC7AGTTCACACGGT7CGAACGCATACCGGAGCTGACGACAAG7CT7TTCG77GCG7GCA 

* * * * ******** * * * ***★ ' * * * 

Con . 4 RYNNGW7BAMAYRRWTMNNNNNNAKAMCKRAKYWGWNRABVNSTC7TWKSKT7KVRTSCW 

A . t . AT7CGAGAATG77TT7G7CAAAGA7AG7GGCGA7T77GAACCAAAGAAAACA777AAA-A 
L.a. AC7CGAG7A7TGAGTA77GAACGCT7 7777TAAA7AAAA7C77GAT7777A-A 

B. n. A — CAAGGA-GACGA7GAGAACGGCAA AA7CGGAG7GAG777G7CCAAGGACA7A 

******* ** * * * * * * *-* 

Con . 4 ANNCRAGDANKDHKWWKWSAAMGVYWNNNNNNNOT^ 

A. t. AATCAGTA7CCGG7TAC GTTCATGCAAATAGAAAG7GG7CTA GGA7C7GATT- 

L.a. AT7GG7TT7TTGAG7AAAAAAGT7C77AA7A777TC7C777G7777AA7GGG777G7TT- 

B. n. ACCGA7G77GCTGG7CGAACGGTTAAGAAAAACATAGCAACG77 GGG7CCG77GA 

* * * ** *** * ** * ** * * * 

Con . 4 AHYSRKKW7BYKRK7MVNNNNGT7MWKRMWAWYWKMDMDWBG7YNNNNNGGR7YYGW7KN 

A. t. G7AA77TTAGA — C77AAAGAG7C7C — T7AAGA7TCAATCCTGGC7-GTG7ACAAAACT 
L.a. 7GCATT7TA7AAGCT7AA77T77C7AA777AA7A77T7A7CTA7CA7CG7CCGTAAAGT7 

B. n . TTC7TCCG7TAAGCGAGAAAC77CTT — TT7TTCGT7A — CC77CA7GGGCAAGAAAC77 

* **** * *** ♦ * * *** * * *** » 

Con . 4 KKMWT YYKWKANNCKWRAWDHKTCTHNN7TWWKMKT YWNNCYWKSMTNGKSHRBAAAVYT 

A. t. ACAAA7AATA7A T7TTAGAC7A7T7GGCCTTAACTAAAC7TCCA-C7CA77A7TTA 

L.a. TTATTTGGCACAAACTTGTTTTACTTT7C7ACCTTA—7AATT7GGGAAC7GGTTGAG7- 

B. n. 77CAAAGA7AAAA7CAAACAT7ACTACG7CCCGGATTTCAAAC77GC7A77GACCATTTT 

★ ** * *** * ***** * 

Con . 4 WYMWWWRRYAHANNNNWDYWWKACTWYKYBVCSK1^NYAAWY7KSSWNYTSRYYRWKTN 

A. t. -CTGAGGTTAGAGAA--TAGACTTGCGAATAAACACATTCCCGAGAAATACTCATGATCC 
L.a. -CAAAGCG7ACCGGA — CAAA7A7G77T-TATA7TCT7AT77AAGAAT7AACAC7CA7CT 

B. n. 7G7ATACATGCCGGAGGCAGAGCCG7GA7TGATG7GC7AGAGAAGAACC7AGCCC7AGCA 

****** * *• **** * * 

Con . 4 NSWRWRSD7RSMGRANNYARABHYGYKWNTRWWBWSHTWBHBRAGAAHYWMBMMYBAKCH 
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Figure 4 Continued: Alignment of At, La. and Bm. FAE1 promoters 

CE3 

A. t . CATAATTAGTCAGAGGGTATG CCAATCAGATCTAAGAAC fiCACATTCCCTCj 

L . a . CATAAT TAGTCAGAGGCTAGGGAGAT T CAGCCAATCAATGCTAACAACAAA- ATTCTCTT 

B. n. CCGATCGATGTAGAGGC ATCAAGATCAACGTTACATAGATTTGG 

* * ★ ***** ** * ******* ** * * * 

Con . 4 CMKAWYKAKKYAGAGGSNNNNNNNNNNNNNNNNATCARDDYYAASRWYAMANAKWYY YKB 

A. t. AA — ATTTTA — ATGCAC-ATGTAATCAT AGTTT AGC ACAAT T CAAAA 

L- <3 - AATGATCTAACGATGCT — ATTTAATATTCGGATCAGTATTCTTAAATAAGAATATAAAA 

B. n. AAACACTTCATCTAGCTCAATATGGTATG AGTTGGCATACAT-AGAAG -CAAAA 

** * * * ****** * *** * ** * * ** **** 

Con . 4 AANNAYYTHANNVWGCWNNATDTRRTMWKNNNNNNAGTWKNNNNNNAKNASAAKNYAA^ 

A . t . ATAATGTAGTA-TTAAAGACAGAAATTTGTA — GACTTTTTT TTGGCGT-TAAAGG 

£ . a • CTAATTCAATAGTTACAGATAAAAACTTATATAGACTTTTTTAT — TTGGAATATAAAAG 

B. n. GGAAGGATGAA-GAAAGGTAATAAAGTTTGGCAGATTGCTTTAGGGTCAGGCT7TAAGTG 

** * * * + *** ** *** * **** * * * *** * 

Con . 4 VKAAKKHWRWANKWAMRGWHADAAABTTDKRNNGAYTKYTTTNNNNTYRGWTNTAARDG 

A. t . AA GACTAAGTTTATA-CGT AC ATTT - T ATTTTAAGT 

L . a . TATCAATATATTATA-GACAATATTTATAACGTTAAAAATACAATATTTATATTTTTTAT 

B. n. TAACAGTGCAGTTTGGGTGGCTCTAAACAATGTCAAAGCTTCGACAAATAGTCCTTGGGA 

* ** * * * * * * ***** *** ****** ** 

Con . 4 WANNNNNNNNNNNNNNGWSDMWVTWWAYANYGTNNNNNNNNNNAYAWWTNKWYYTTDDRW 

CE1 

A. t. GGA AAACCGAAATT — [TTCCATCGAl AATATATGA — ATTT -AG TAT AT 

£•3 . ATATTTATTTCAAATTGAAAAGCATTACTTCTATCGAAATGA — ATTTTAGTATATTAAT 

B. n. ACACT GCATCGACAGATACCCGGTCAAAATTGATTCTGATTCAGGTAAGTCAGA 

* * ***** ** * ** *** *** ★ ★ 

Con . 4 RBAYTNNNNNNRMAYYGAYADDYAYYMSDTCDAWMKWDATKMNNATTYNRGTAWRTNNNN 

G-box2 

A. t . — ATATTTCTGCAAT GT ACTATTTTGCTATTTTG G jCAA-CTTl TCAGTGGACTAC 

£ • <3 • TAATATTTTTTTAATC GGACTACTTTCCTATTTTGGCAC-CTTTCATCTGACTAC 

3. n . GACTCGTGTCCAAAACGGTCGGTCCTAATAAACGATGTTTGCTCTCTTTCGTTT — CTTT 

* + * ** * * *** * * ** ** *★ ***** «*** 

Con . 4 NNMTMKTKYYBHAAWNNNNNNGKMCTAHTWWVCKATKTTKGCWMNCTTTCRKYKNNCTWY 

G-boxl 

A . t . TACTTTATTACAATGTGT — ATGGATGC-ATGAG TTTGAGTA-TAJCACATGJTCTAAA 

L . a . TAATTTATTTCAATGTGT — ATGCATGC-ATGAG CATGAG7AATACACATGTCTATA 

B. n. TTATTTGTTATAATAATTTGATGGCTACGATGTTTCTCTTGTTTGTTATGAATAAAGAAT 

* *** ** *** * * * * * * *** *# * ** •* * 

Con . 4 TWMTTTRTTWYAATRWKTNNATGSMTRCNATGWKNNNYWTGWKTRWTAYRMATRWMKAWW 

A-300 £M1 ABA 

A. t . TGCATGCT-TTGCAAAACGTAACGGACC-ACAAAAGAGGATCCA TTGCAAATACATCTCATl 
I • <3 . TAAATGCA-T-GTAAAACGTAACGGACC-ACAAAAGTGGATCCATACAAATACATCTCAT 

B. n. GCAATGGTGTTCTAGTATTTGATTGTTTTACATGTATGTATCTCTT-ATTTACATGAAAT 

*** * ***** *** * *** *' * ***** ** 

Con . 4 . KVMATGSWNTNSYARWAYKTRAYKGWYYNACAWRWRWGKATCYMTDNAWWTACATSWMAT 

A. t . AGC-TTCCTCCATTATTTTCCGACACAAA-CAGAGCA 

£ . a . CGC-ACCCTC TCCGACACAAAACTGAACA 

B. n. TTTTAAACGCC TAAAAAAAAAAACGGAATTCCG 

* * * * * *** * + * 

Con . 4 HKYNWHMCKCNNNNNNNNTMMRAMAMAAANCDGARYWNNN 
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Figure 5: Alignment of/!./, and La. FAE1 promoters 

CLUSTAL W (1.74) multiple sequence alignment 



A.e. ACTCATAA 

L . a . CGCCGGGGAGTTTCAGCTTAACCGGTAAAATTGGCCTGTACATATATTTACCACTGAGTA 

★ ** * ★ 

Con. 5 ACTSAKWA 

A. t . AAACTAGTAGAT — TGGTTGGTTGGTTTCCA — TGTACCAGAAGGCTTACCCTATTAGTT 

L . a . AAGACATCAGTTAATGATTTGTTGTTACTCAATTGGGCTAAGTGTATTATTATATGTGTT 
** ★ ** * ** ** **** ★ ** ** * ★ * *** **★ * ★ * 

Con . 5 AARMYAKYAGWTNNTGRTTKGTTGKTWYYCANNTGKRCYARRWGKMTTAYYMTATKWGTT 

A. C . GAAAGTTGAAACTTTGTTCCCTACTCAATTCCTAGTTGTGTAAATGT ATGTATATGT 

L . a . GTATATAATAAAGGTAGAACGTAA — ATTTACTAAGAATGTGTTTTTCCAATGTGATTGC 

* + * ** * * ** * ** **+ *** * * ***★ ** 

Con . 5 GWAWRTWRWAAMKKTRKWMCSTAMNNAWTTMCTARKWRTGTRWWTKTNNNATGTRWWTGY 

A . t . AAT GCGTATAAAACGTAGTACTTAAATGACTAGGAGTGGTTCTTGAGACCGATGAGA 

L . a . TCTTTGGCCTCTTAGGTTTGAATCCTACTCGAGAAG-ACTAATTTTAATTTACTGGCAAA 
* ** * * * * * ** ** ** * * ** * * ♦ * 

Con . 5 WMTNNNGCSTMTWARRYKTRRWWCYTAMWYGA3WAGNASTRRTTYTWRWKWMCKRKSARA 

A. t . GATGGGAGCAGAACTAAAGATGATGACATAATTA AGAACGAATTTGAAAGG-CT 

L.a. AATAGAAATCAATTTATAAGTGTTTAAACAAATCGATGGTATAACTGATTAGTGATCACT 
** * * * ** * ***** ** * * *** *★* * * ** 

Con . 5 RATRGRARYMRAWYTAWARRTGWTKAMAYAAWTMNNNNNNAKAACKRATTWGWRAKSNCT 

A. t. CTTAGGTTTGAATCCTATTCGAGAATGTTTTTGTCAAAGATAGTGGCGATTTTGAACCAA 
L . a . CTTAGGTTTTGATCCAACTCGAGTATTGAGTATTGAACGCTT TTTTTAAATAA 



Con . 5 CTTAGGTTTKRATCCWAYTCGAGWATKKWKTWKTSAAMGMTWNNNNNNNTTTTKAAMYAA 

A. t . AGAAAACATTTAAAAAATCAGTATCCGGTTAC GTTCATGCAAATAGAAAGTGG7CT 

L.a. AATCTTGATTTTTAAATTGGTTTTTTGAGTAAAAAAGTTCTTAATATTTTCTCTTTGTTT 

* **** *** * * * * ** **** * * * * ** * 

Con . 5 ARWMWWSATTTWWAAAWTSRKTWTYYGRKTAMNNNNGTTCWTRMWAWTWKMWMKTKGTTT 

A. t . A GGATCTGATTGTAATTTTAGA — CTTAAAGAGTCTC — TTAAGATTCAATCCTGGC 

L.a. TAATGGGTTTGTTTTGCATTTTATAAGCTTAATTTTTCTAATTTAATATTTTATCTATCA 

** * ** ** ****** * ***** *** **** *** *** 

Con . 5 WNNNGGRT YTGWTTKKMATTTTAKANNCTTAAWKWKTCTMNNTTAAKATT YWATC YWKSM 

A. t . T-GTGTACAAAACTACAAATAATATA TTTTAGACTATTTGGCCTTAACTAAAC7TC 

L.a. TCGTCCGTAAAGTTTTATTTGGCACAAACTTGTTTTACTTTTCTACCTTA — TAATTTGG 

* ** *** * * * * * * ** *** ** ***** *** 

Con . 5 TNGTS YRYAAARYTWYAWWTRRYAYANNNNTKTTWKACTWTT YKRCCTTANNTAAWYTKS 

A . t . CA-CTCATTATTT ACTGAGGTTAGAGAATAGACTTGCGAATAAACACATTCCCGAGAAAT 

L.a. GAACTGGTTGAGT-CAAAGCGTACCGGACAAATATGTTT-TATATTCTTATTTAAGAATT 
* ** ** * * ** ** * * * * ** ** * * * **** * 

Con . 5 SANCTSRTTRWKTNCWRAGSKTASMGRAYARAYWTGYKWNTAWAYWCWTWYYYRAGAAWT 

-400 

A. t. -4 32 ACTCATGATCCCATAATTAGTCAGAGGGTATG [5|CAAT C AG AT CT AAGAACA 

L.a. AACACTCATCTCATAATTAGTCAGAGGCTAGGGAGATTCAGCCAATCAATGCTAACAACA 

* * *** **************** ** * ******* **** **** 

Con . 5 AMYMMTSATCYCATAATTAGTCAGAGGSTAKGNNNNNNNNNCCAATCARWKCTAASAACA 
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Figure 5 Continued: Alignment of Ax and L.a. FAE1 promoters 

A. £. -381 CACATTCCCTCAA — ATTTTA — ATGCACATGTAATCAT AGTTT AGCA 

L.a. AA-ATTCTCTTAATGATCTAACGATGCT-ATTTAATATTCGGATCAGTATTCTTAAATAA 

* **** ** ** * ★ * * **** ** **** * *** * * * 

Con . 5 MANATTC YCT YAANNATYTWANNATGCTNATKTAATMWTNNNNNNAGTWTNNNNNNAKMA 



A. t. -337 CAATTCAAAAATAATGTAGTA-TTAAAGACAGAAATTTGTA — GACTTTTTT — TTGGCG 

L.a. GAATATAAAACTAATTCAATAGTTACAGATAAAAACTTATATAGACTTTTTTATTTGGAA 

Con . 5 SAATWYAAAAMTAATKYARTANTTAMAGAYARAAAYTTRTANKGACTTTTTTNNTTGGMR 



A. t. -282 T-TAAAGGAA GACTAAGTTTATA-CGT ACATTT-TAT 

L.a. T AT AAAAGT ATCAAT ATAT TAT AG AC AATATT TAT AACGT TAAAAATACAATATT TAT AT 

* **** * * *** * ****** *** * **** **★ 

Con . 5 TNTAAARGWANNNNNNNNNNNNNGACWAWRTTTATANCGTNNNNNNNNNNAYATTTNTAT 

A. t. -247 TTTAAGTGGA AAACCGAAATT — TTCCATCGAAATATATGAATTT - AGTAT A 

L.a. TTTTTATATATTTATTTCAAATTGAAAAGCATTACTTCTATCGAAATGAATTTTAGTATA 
*** * * *** **** ****** * ******** ****** 

Con . 5 TTTWWRTRKANNNNNNNNAAAYYGAAAWKNNTTMCWTCKAWMKAWATGAATTTNAGTATA 

A. C. -198 T ATATTTCTGCAAT-GTACTATTTTGCTATTTTGGCAACTTTCAGTGGACTACT 

L.a. TTAATTAATATTTTTTTAATCGGACTACTTTCCTATTTTGGCACCTTTCATCTGACTACT 



Con . 5 TNNNNNNATATTTYTKYAATNGKACTAYTTTSCTATTTTGGCAMCTTTCAKYKGACTACT 

A. t . - 1 4 5 ACTTTATTACAATGTGTATGGATGCATGAGTTTGAGTA-TACACATGTCTAAATGCATGC 

L.a. AATTTATTTCAATGTGTATGCATGCATGAGCATGAGTAATACACATGTCTATATAAATGC 
* ****** *********** ********* ****** ************ ** **** 

Con . 5 AMTTTATTWCAATGTGTATGSATGCATGAGYWTGAGTANTACACATGTCTAWATRMATGC 

A. t . -86 TTTGCAAAACGTAACGGACCACAAAAGAGGATCCATGCAAATACATCTCATAGCTTCC7C 

L.a. AT-GTAAAACGTAACGGACCACAAAAGTGGATCCATACAAATACATCTCATCGCACCCTC 

Con . 5 WTNGYAAAACGTAACGGACCACAAAAGWGGATCCATRCAAATACATCTCATMGCWYCCTC 

A. t . - 2 6 CATTATTTTCCGACACAAA-CAGAGCA 

L.a. TCCG ACACAAAACTG AACA 

Con . 5 NNNNNNNNTCCGACACAAANCWGARCA 
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Figure 6 



A: FAE1 promoter-castor hydroxylase 
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B: Napin promoter-castor hydroxylase 




Individual transformed lines 
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Figure 7: Alignment of An. and Leu FAE1 promoters 
CLUSTAL W (1.81) multiple sequence alignment 



BnFAEl 
LaFAEl 



GGTTGGGCAAATCTGACTTCACCAAAGAAACAACTCGAGTCGTTATCCATCTCCTCATAA 60 



BnFAEl 
LaFAEl 



CCATCGCTCCACTCTTTGCCTTCACCGTTTTCGGTTCGGTTCTCTACATCGCAACCCGGC 120 



BnFAEl 
LaFAEl 



BnFAEl 
LaFAEl 



CCAAACCGG77TACC7CG77GAG7AC7CA7GC7ACCT7CCACCAACGCA77G7AGA7CAA 180 

CGCCGGGGAG7-77CAGC77AACCGG7AAAA77GGCC7G7ACATA7A 4 6 

* ** **** **• ** * * * * **+* ** * 

GTATCTCCAAGGTCATGGATATCTTTTATCAAGTAAGAAAAGCTGATCCTTCTCGGAACG 240 

TTTACCACTGAGT-AAAGACATCAGTTAATGATTT GTTGTTACTCAATTGGGCT 99 

* * * ** * ** **♦ #♦* * * *••**• * * 



BnFAEl 
LaFAEl 



GCACG7GCGA7GAC7CG7CG7GGC77GAC77C77GAGGAAG A77CAAGAACG77CAGG7C 300 
AAGTGTATTATTATATGTGTTG TATATAATAAAGG7 AGAACG7 — AAA77 1 4 7 



BnFAEl 
LaFAEl 



TAGGCGA7GAAAC7CACGGGCCCGAGGGGC7GC77CAGG7CCC7CCCCGGAAGAC7TTTG 360 
TA--CTAAGAATGTC?ITr7TCCAATG7GATTGCTCrTTGGCCTCTTAGGTTTGAATCCTA 205 



* * *** 



BnFAEl 
LaFAEl 



CGGCGGCGCG7GAAGAGACGGAGCAAGTTATCATTGGTGCGCTAGAAAATCTATTCAAGA 4 2 0 
CT CGAGAAGAC7AA77T7AA7 -77AC7GGCAAAAA7AGAAA7CAA7TTA7AA 256 



BnFAEl 
LaFAEl 



BnFAEl 
LaFAEl 



ACACCAACG77AACCC7AAAGA7A7AGG7ATAC7TG7GG7GAAC7CAAGCA7G7TTAA7C 480 
GTGTT7AAACAAA7C — GATGG7A7AAC7G- A77AG7GA7CACTC77AGG77 — 77GA7C 311 
* *• * * * **** # * * • •• * * ** * ** *** 

CAACTCCA7CGC7C7CCGCGA7GG7CG77AACAC777CAAGC7CCGAAGCAACG7AAGAA 540 
CAAC7CGAG7A77G AG7A77GAACGC777 777TAAA7AAAA7C77GA 358 



BnFAEl 
LaFAEl 



GC777AACC77GG7GGCA7GGG77G7AG7GCCGGCG77A7AGCCA77GA7C7AGCAAAGG 600 

77TT7AAA-77GG777777GAG7AAAAAAG77C77AA7A7777C7C77-7G7777AA7GG 416 
***** «***• »* •* • * * + + * * * * • * ** 



BnFAEl 
LaFAEl 



BnFAEl 
LaFAEl 



AC7TG77GCA7G7CC-A7AAAAA7ACG7A7GC7C77G7GG7GAGCACAGAGAACA7CAC7 65 9 

G77TG7777GCA7777A7AAGC77AA7777TC7AA777AA7-A77T7A7C7A7CA7CG7C 475 
***** * **** »• * * * * * * • « * * •*** 

719 

CG7AAAG777 7A777GGCACAAAC77G7777A C7777C7ACC77A7A 522 



BnFAEl 
LaFAEl 



GGTGGGGCCGC7A7T77GC7C7CCAACAAGCC7GGAGA7CG7AGACGG7CCAAG7ACGAG 77 9 

ATTTGGGA-AC7GG77GAG7CA AAGCG7ACCG GACAAA7A7G7777A7AT7C 573 

* *** ** ** *» *•** * * * *♦*• * * * 



BnFAEl 
LaFAEl 



CTAG77CACACGG77CGAACGCA7ACCGGAGC7GACGACAAGT C T77TCG77GCG7GCAA 839 

-TTAT77A-AGAA77AACAC7CATC7CA7AA77AG7CAGAGGC 7AGGGAGAT7 62 4 

***** ** ** **# * * * •** **** 



BnFAEl 
LaFAEl 



CAAGGAGACGA7GAGAACGGCAAAA7CGGAG7GAG777G7CCAAGGACA7AACCGATG77 899 

CAGCCAA7CAA7GC7AACAACAAAA77C7CT7AA-- 7GA7C7AACGA7GC7A77TAA7AT 682 
** * * *** *** ****** * * * * * ** ** * * * 
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Figure 7 Continued: Alignment of B.tu and La. FAE1 promoters 



BnFAEl 
LaFAEl 



GC7GG7CGAACGGTTAAGAAAAACA7AGOttCG77GGGTCCG77GATTCTTCCGT?A-AG 95 B 

TCGGATCAGTATTCTTAAATAAGAATATAAA ACTAATTCAATAGTTACAG 732 

* * ** ***** *** ** * **** **<** ** 



BnFAEl 
LaFAEl 



CGAGAAACTTC T T XTTTT CGTTACCTTCATGGGCAAGAAACTTT7CAAAGATAAAATCAA 1018 

A7AAAAAC7TA7A7AGACT77T77ATT7G-GAA7ATAAAAG7A7CAATATA7TA7AGACA 791 
* ****•★ • * ** • * * * *** * * * * ** * * * 



BnFAEl 
LaFAEl 



ACATTAC7ACG7CCCGGA77TCAAAC77GC7A7TGACCA77T77G7ATACA7GCCGGAGG 1078 
ATATTTATA ACGTTAAAAATACAATATTTATATTTTTTATATATTTATTTCAAA 84 5 



** ** 



BnFAEl 
LaFAEl 



CAGAGCCGTGATTGATGTGCTAGAGAAGAACCTAGCCCTAGCACCGATCGATGTAGAGGC 1138 
TTGAAAAGCATTACTTCTATCGAAATGAATTTTAGT ATATTAATTAATATTTTTTT 901 



BnFAEl 
LaFAEl 



ATCAAGATCAACGTTACATAGATTTGGAAACACTTCATCTAGCTCAATATGGTATGAGTT 1198 

AATCGGACTACTTTCCTAT TTTGGCACCTTTCATCTGACT ACT 944 

* ****** ***** ******* *• * 



BnFAEl 
LaFAEl 



GGCATACATAGAAGCAAAAGGAAGGATGAAGAAAGGTAATAAAGTTTGGCAGATTGCT7T 1258 
AATTTATTTCAATGTGTATGCATGCATGAGCATGAGTAATA CACATGTCTAT 996 



BnFAEl 
LaFAEl 



AGGGTCAGGC7T7AAGTG7AACAGTGCAGTT7GGGTGGCTCTAAACAATGTCAAAGCTTC 1318 

ATAAATGC ATGTAAAACG7AACGG - ACCACAAAAG7GGA7CCA7ACAAA7 ACATC7CA7C 1055 
* * ** ***** * * **** *♦ * **** *• * •• 



BnFAEl 
LaFAEl 



GACAAATAG7CC77GGGAACAC7GCA7CGACAGATACCCGG7CAAAA77GA77CTGA77C 1378 

G-CACCC7C7CCGACACAAAAC7GAACA 1082 

* ** *** ** *«** * 



BnFAEl 
LaFAEl 



AGG7AAG7CAGAGAC7CG7G7CCAAAACGG7CGG7CC7AA7AAACGA7G7T7GC7C 



1438 



BnFAEl 
LaFAEl 



TCG777CT7777A77TG77A7AA7AA777GATGGCTACGA7G777C7CT7GT77G77ATG 14 98 



BnFAEl 
LaFAEl 



AA7AAAGAA7GCAA7GG7G77C7AG7A777GA77C7777ACA7G7A7G7A7C7C77A777 1558 



BnFAEl 
LaFAEl 



ACATGAAA77777AAACGCC7AAAAAAAAAAACGGAA77CCG 1600 



13/15 



SUBSTITUTE SHEET (RULE 26) 



WO 01/11061 



PCT/CA00/00907 



Figure 8: Alignment of B.n. and Ax FAE1 promoters 
CLUSTAL W (1.81) multiple sequence alignment 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



AtFAEl 
BnFAEl 



GGTTGGGCAAATCTGACTTCACCAAAGAAACAACTCGAGTCGTTATCCATCTCCTCATAA 60 
CCATCGCTCCACTCTTTGCCTTCACC GTTTT CGGTTCGGTTCTCTACATCGCAACCCGGC 12 0 

CCAAACCGGTTTACCTCG7TGAGTACTCATGCTACCTTCCACCAACGCATTGTAGATCAA 180 

— — — — CATAAAA 10 

GTATCTCCAAGGTCATGGATATCTTTTATCAAGTAAGAAAAGCTGATCCTTCTCGGAACG 240 

**• ** 

ACTAGTAGATTGGTTGGT— TGGTTTCCATGTACCAGAAGGCTT ACCCTATTAGT 63 

GCACGTGCGATGACTCGTCGTGGCTTGACTTCTTGAGGAAGATTCAAGAACGTTCAGGTC 300 

* ** ** * ** *»* ** * ** * * ** ** * 

TGAAAGTTGAAACTT-TGTTCCCTACT — CAATTCCTAGT7GTGTAAATGTATGTATATG 12 0 

TAGGCGATGAAACTCACGGGCCCGAGGGGCTGCTTCAGGTCCCTCCCCGGAAGACTTTTG 360 

* * ******* « *** * * * • *« * * * ** 

TAATG-CGTATAAAACGTAGTACTTAAATGACTAGGAGTGGTTCTTGAGACCGATGAGAG 17 9 

CGGCGGCGCGTGAAGAGACGGAGC-AAGTTATCATTGGTGCGCTAGAAAATCTATTCAAG 419 
* ** * ** * * * ** * * * *** * * * ** ** 

A TGGGAGCAGAACTAAAGATGATGACATAATTAAGAACGAATTTGAAAGGCTCTT A 235 

AACACCAACGTTAACCCTAAAGATATAGGTATACTTGTGG-TGAACTCAAGCATGTTTAA 478 

* * **♦***■*♦ « *** *• * *** • * * * * 

GGTTTGAATCCTATTCGAGAATGTTTTTGTCAAAGATAGTGGCGA-TTTTGAACCAAAGA 294 

TCCAACTCCATCGCTCTCCGCGATGGTCGT7AACACTTTCAAGCTCCGAAGCAACGT 535 

****** * * *** # * * * • * **• * * * * 

AAACAT7TAAAAAATCAGTATCC — GGTTAC-GTTCATGCAA-ATAGAAAGTGG7CTAGG 350 
AAGAAGCTTTAACCTTGGTGGCATGGGT7GTAG7GCCGGCG7TATAGCCATTGATCTAGC 595 

********** **•* « « * • « *«*• * ** ***** 

ATCTGA7TG7AATTTTAGACTTAAAGAGTCTCTTAAGATTCAATCCTGGCTGTGTACAAA 410 
AAAGGACTT — GTTGCATGTCCATAAAAATACGTATGCTCTTGTGGTGAGCACAGAGAAC 653 

* ** * ** * * * * * ** * * * ** * ♦ * 

ACTACAAATAATATAT TTTAGACTATTTGGCCTTAACTAAACTTCCACTCATTATTT 4 67 

ATCACTTATAACATTTACGCTGGTGATAATAGG7CCATGATGGTTTCAAATTGCTT G T T C 713 

* ** **** *• * * ****** * ♦ ****** 

ACTGAGGTTAGAGA-ATAGACTTGCGAATAAACACATTCCCGAGAAATACTCATGATCCC 52 6 

CGTGTTGGTGGGGCCGCTATTTTGC7CTCCAACAAG— CCTGGAGATCGTAGACGGTCCA 771 
** * * * * **** **** ** * * * * *** 

CE3 

ATAATTAGTCAGAGGGTATG — CCAATCAGATCTAAGAA QACACATTCCCTQ ftAATTTTA 584 

AGTACGAGCTAGTTCACACGGTTCGAACGCATACCGGAGCTGACGACAAGTCTTTTCGTT 831 
****** * * ***** ***** ** * * 

ATGCACATGTAATCATAGTTTAGCACAATTCAAAAATAATGTAGTATTAAAGACAGAAAT 644 

GCGTGCAACAAGGAGACGATGAGAACGGCAAAATCGGAGTGAGT7TGTCCAAGGACATAA 891 

* ** * ****** ** * ** * * * * * * 
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Figure 8 Continued: Alignment of Bjl and A.L FAE1 promoters 



AtFAEl 
BnFAEl 



TTGTAGACTTTTTTTTGGCGTTAAAGGAAGACTAAG ~ TTTATACGTACATTTTAT 698 

CCGATGTTGCTGGTCGAACGGTTAAGAAAAACATAGCAACGTTGGGTCCGTTGATTCTTC 9S1 
* * * * ** • * ** ** ** ** * * *** *•• « 



AtEAEl 
BnFAEl 



AtFAEl 
BnFAEl 



T-TTAAGTGGAAAACCGAAATTTTCCAT — CGAAATATATGAATTTAGTATATATA 751 

CGTTAAGCGAGAAACTTCTTTTTTTCGTTACCTTCATGGGCAAGAAACTTTTCAAAGATA 1011 
***** * **** **** * * * ******* *** 

G box 2 



TTTCTGCAATGTACTATTTTGCTATTTTGG CAACTOT CAGTGGACTACTACTTTAT-TAC 810 
AAATCAAACATTACTACGTCCCGGATTTCA-AACTTGCTATTGACCATTTTTGTATACAT 1070 



***** * * *•* 



*** * 



AtFAEl 
BnFAEl 



G-box 1 



AATGTGTATGGATGCATGAGTT -TGAGTATA fcAC AT GT TCTAAATGCATGCTTTGCAAAAC 869 

GCCGGAGGCAGAGCCGTGATTGATGTGCTAGAGAAGAACCTAGCCCTAGCACCGATCGAT 1130 
* #* * **• * ** * ******* * 



AtFAEl 
BnFAEl 



GTAACGG-ACCACAAAAGAGGATCCAT GCAAATACATCTCATAGCTTCCTCCAT 922 

GTAGAGGCATCAAGATCAACGTTACATAGATTTGGAAACACTTCATCTAGCTCAATATGG 1190 
*** ***** * * * * *** * *** ** ** ***** * 



AtFAEl 
BnFAEl 



TATTTTCCGACACAAACAGA-GCA 94 5 

TATGAGTTGGCATACATAGAAGCAAAAGGAAGGATGAAGAAAGG7AATAAAGTTTGGCAG 1250 
* * * ***** *** *** 



AtFAEl 
BnFAEl 



ATTGCTTTAGGGTCAGGCTTTAAGTGTAACAGTGCAGTTTGGGTGGCTCTAAACAATGTC 1310 



AtFAEl 
BnFAEl 



AAAGCTTCGACAAATAGTCCTTGGGAACACTGCATCGACAGATACCCGGTCAAAATT6AT 1370 



AtFAEl 
BnFAEl 



TCTGATTC AGGTAAGTCAGAGACTCGTGTCCAAAACGGTCGGTCCTAATAAACGATGTTT 1430 



AtFAEl 
BnFAEl 



GCTCTCTTTCGTTTCTTTTTATTTG77ATAATAATTTGATGGCTACGATG77TC7CTTGT 14 90 



AtFAEl 
BnFAEl 



TTGTTATGAATAAAGAATGCAATGGTGTTCTAGTATTTGATTGTTTTACATGTATGTATC 1550 



AtFAEl 
BnFAEl 



TCTTATTTACATGAAATTTTTAAACGCC7AAAAAAAAAAACGGAATTCCG 1600 
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